
co no my 
Edition 


| ^MpW 

« Iffftu 


Theory of 

Computer 

Science 

1 

Automata, Languages and Computation 1 


Third Edition 



K.L.P. Mishra 
N. Chandrasekaran 












THEORY OF COMPUTER SCIENCE 

Automata, Languages and Computation 

THIRD EDITION 


K.L.P. MISHRA 

Formerly Professor 

Department of Electrical and Electronics Engineering 
and Principal, Regional Engineering College 
Tiruchirapalli 

N. CHANDRASEKARAN 

Professor 

Department of Mathematics 
St. Joseph's College 
Tiruchirapalli 


Prentice'Hall of India Potato MmifeP 

New Delhi - 110 001 
2008 


http://engineeringbooks.net 



Contents 



Preface ix 

Notations xi 


1. PROPOSITIONS AND PREDICATES 1-35 

1.1 Propositions (or Statements) 1 

Ll.l Connectives (Propositional Connectives 
or Logical Connectives) 2 

1.1.2 Well-formed Formulas 6 

1.1.3 Truth Table for a Well-formed Formula 7 

1.1.4 Equivalence of Well-formed Formulas 9 

1.1.5 Logical Identities 9 

1.2 Normal Forms of Well-formed Formulas 11 

1.2.1 Construction to Obtain a Disjunctive Normal 
Form of a Given Formula 11 

1.2.2 Construction to Obtain the Principal 

Disjunctive Normal Form of a Given Formula 12 

1.3 Rules of Inference for Propositional Calculus 
(Statement Calculus) 15 

1.4 Predicate Calculus 19 

1.4.1 Predicates 19 

1.4.2 Well-formed Formulas of Predicate Calculus 21 

1.5 Rules of Inference for Predicate Calculus 23 

1.6 Supplementary Examples 26 

Self-Test 31 

Exercises 32 


http://engineeringbooks.net 



iv B Contents 


2. MATHEMATICAL PRELIMINARIES 36-70 

2.1 Sets, Relations and Functions 36 

2.1.1 Sets and Subsets 36 

2.1.2 Sets with One Binary Operation 37 

2.1.3 Sets with Two Binary Operations 39 

2.1.4 Relations 40 

2.1.5 Closure of Relations 43 

2.1.6 Functions 45 

2.2 Graphs and Trees 47 

2.2.1 Graphs 47 

2.2.2 Trees 49 

2.3 Strings and Their Properties 54 

2.3.1 Operations on Strings 54 

2.3.2 Terminal and Nonterminal Symbols 56 

2.4 Principle of Induction 57 

2.4.1 Method of Proof by Induction 57 

2.4.2 Modified Method of Induction 58 

2.4.3 Simultaneous Induction 60 

2.5 Proof by Contradiction 61 

2.6 Supplementary Examples 62 

Self-Test 66 

Exercises 67 

3. THE THEORY OF AUTOMATA 71-106 

3.1 Definition of an Automaton 71 

3.2 Description of a Finite Automaton 73 

3.3 Transition Systems 74 

3.4 Properties of Transition Functions 75 

3.5 Acceptability of a String by a Finite Automaton 77 

3.6 Nondeterministic Finite State Machines 78 

3.7 The Equivalence of DFA and NDFA 80 

3.8 Mealy and Moore Models 84 

3.8.1 Finite Automata with Outputs 84 

3.8.2 Procedure for Transforming a Mealy Machine 

into a Moore Machine 85 

3.8.3 Procedure for Transforming a Moore Machine 

into a Mealy Machine 87 

3.9 Minimization of Finite Automata 91 

3.9.1 Construction of Minimum Automaton 92 

3.10 Supplementary Examples 97 

Self-Test 103 

Exercises 104 


http://engineeringbooks.net 


Contents B v 


4. FORMAL LANGUAGES 107-135 

4.1 Basic Definitions and Examples 107 

4.1.1 Definition of a Grammar 109 

4.1.2 Derivations and the Language Generated by a 
Grammar 110 

4.2 Chomsky Classification of Languages 120 

4.3 Languages and Their Relation 123 

4.4 Recursive and Recursively Enumerable Sets 124 

4.5 Operations on Languages 126 

4.6 Languages and Automata 128 

4.7 Supplementary Examples 129 

Self-Test 132 

Exercises 134 

5. REGULAR SETS AND REGULAR GRAMMARS 136-179 

5.1 Regular Expressions 136 

5.1.1 Identities for Regular Expressions 138 

5.2 Finite Automata and Regular Expressions 140 

5.2.1 Transition System Containing A-moves 140 

5.2.2 NDFAs with A-moves and Regular Expressions 142 

5.2.3 Conversion of Nondeterministic Systems to 
Deterministic Systems 146 

5.2.4 Algebraic Method Using Arden’s Theorem 148 

5.2.5 Construction of Finite Automata Equivalent 

to a Regular Expression 153 

5.2.6 Equivalence of Two Finite Automata 157 

5.2.7 Equivalence of Two Regular Expressions 160 

5.3 Pumping Lemma for Regular Sets 162 

5.4 Application of Pumping Lemma 163 

5.5 Closure Properties of Regular Sets 165 

5.6 Regular Sets and Regular Grammars 167 

5.6.1 Construction of a Regular Grammar Generating % 

T(M) for a Given DFA M 168 

5.6.2 Construction of a Transition System M Accepting 
L(G ) for a Given Regular Grammar G 169 

5.7 Supplementary Examples 170 

Self-Test 175 

Exercises 176 

6. CONTEXT-FREE LANGUAGES 180-226 

6.1 Context-free Languages and Derivation Trees 180 

6.1.1 Derivation Trees 181 

6.2 Ambiguity in Context-free Grammars 188 


http://engineeringbooks.net 



vi fl Contents 


6.3 Simplification of Context-free Grammars 189 

6.3.1 Construction of Reduced Grammars 190 

6.3.2 Elimination of Null Productions 196 

6.3.3 Elimination of Unit Productions 199 

6.4 Normal Forms for Context-free Grammars 201 

6.4.1 Chomsky Normal Form 201 

6.4.2 Greibach Normal Form 206 

6.5 Pumping Lemma for Context-free Languages 213 

6.6 Decision Algorithms for Context-free Languages 217 

6.7 Supplementary Examples 218 

Self-Test 223 

Exercises 224 

7. PUSHDOWN AUTOMATA 227-266 

7.1 Basic Definitions 227 

7.2 Acceptance by pda 233 

7.3 Pushdown Automata and Context-free Languages 240 
7 A Parsing and Pushdown Automata 251 

7.4.1 Top-down Parsing 252 

7.4.2 Top-down Parsing Using Deterministic pda's 256 

7.4.3 Bottom-up Parsing 258 

7.5 Supplementary 7 Examples 260 

Self-Test 264 

Exercises 265 

8. LR(k) GRAMMARS 

8.1 LR(k) Grammars 267 

8.2 Properties of LR(£) Grammars 270 

8.3 Closure Properties of Languages 272 

8.4 Supplementary Examples 272 

Self-Test 273 

Exercises 274 

9. TURING MACHINES AND LINEAR BOUNDED 

AUTOMATA 277-308 

9.1 Turing Machine Model 278 

9.2 Representation of Turing Machines 279 

9.2.1 Representation by Instantaneous Descriptions 279 

9.2.2 Representation by Transition Table 280 

9.2.3 Representation by Transition Diagram 281 

9.3 Language Acceptability by Turing Machines 283 

9.4 Design of Turing Machines 284 

9.5 Description of Turing Machines 289 


267-276 


% 


http://engineeringbooks.net 


Contents B vii 


9.6 Techniques for TM Construction 289 

9.6.1 Turing Machine with Stationary Head 289 

9.6.2 Storage in the State 290 

9.6.3 Multiple Track Turing Machine 290 

9.6.4 Subroutines 290 

9.7 Variants of Turing Machines 292 

9.7.1 Multitape Turing Machines 292 

9.7.2 Nondeterministic Turing Machines 295 

9.8 The Model of Linear Bounded Automaton 297 

9.8.1 Relation Between LBA and Context-sensitive 
Languages 299 

9.9 Turing Machines and Type 0 Grammars 299 

9.9.1 Construction of a Grammar Corresponding to TM 299 

9.10 Linear Bounded Automata and Languages 301 

9.11 Supplementary Examples 303 

Self-Test 307 

Exercises 308 

10. DECIDABILITY AND RECURSIVELY ENUMERABLE 

LANGUAGES 309-321 

10.1 The Definition of an Algorithm 309 

10.2 Decidability 310 

10.3 Decidable Languages 311 

10.4 Undecidable Languages 313 

10.5 Halting Problem of Turing Machine 314 

10.6 The Post Correspondence Problem 315 

10.7 Supplementary Examples 317 

Self-Test 319 

Exercises 319 

11. COMPUTABILITY 322-345 

11.1 Introduction and Basic Concepts 322 

11.2 Primitive Recursive Functions 323 

11.2.1 Initial Functions 323 

11.2.2 Primitive Recursive Functions Over N 325 

11.2.3 Primitive Recursive Functions Over { a , £>} 327 

11.3 Recursive Functions 329 

11.4 Partial Recursive Functions and Turing Machines 332 

11.4.1 Computability 332 

11.4.2 A Turing Model for Computation 333 

11.4.3 Turing-computable Functions 333 

11.4.4 Construction of the Turing Machine That 
Can Compute the Zero Function Z 334 

11.4.5 Construction of the Turing Machine for Computing— 
The Successor Function 335 


http://engineeringbooks.net 



viii S Contents 


11.4.6 Construction of the Turing Machine for Computing 
the Projection Uf 336 

11.4.7 Construction of the Turing Machine That Can 
Perform Composition 338 

11.4.8 Construction of the Turing Machine That Can 
Perform Recursion 339 

11.4.9 Construction of the Turing Machine That Can Perform 
Minimization 340 

11.5 Supplementary Examples 340 
Self-Test 342 
Exercises 343 


12. COMPLEXITY 346-371 

12.1 Growth Rate of Functions 346 

12.2 The Classes P and NP 349 

12.3 Polynomial Time Reduction and NP-completeness 351 

12.4 Importance of AT^complete Problems 352 

12.5 SAT is AP-complete 353 

12.5.1 Boolean Expressions 353 

12.5.2 Coding a Boolean Expression 353 

12.5.3 Cook’s Theorem 354 

12.6 Other /VP-complete Problems 359 

12.7 Use of AP-completeness 360 

12.8 Quantum Computation 360 

12.8.1 Quantum Computers 361 

12.8.2 Church-Turing Thesis 362 

12.8.3 Power of Quantum Computation 363 

12.8.4 Conclusion 364 

12.9 Supplementary Examples 365 

Self-Test 369 

Exercises 370 


Answers to Self-Tests 373-374 

Solutions (or Hints) to Chapter-end Exercises 375-415 

Further Reading 417- 418 

Index 419-422 


http://engineeringbooks.net 



Preface 


The enlarged third edition of Theory of Computer Science is the result of the 
enthusiastic reception given to earlier editions of this book and the feedback 
received from the students and teachers who used the second edition for 
several years. 

The new edition deals with all aspects of theoretical computer science, 
namely automata, formal languages, computability and complexity. Very 
few books combine all these theories and give adequate examples. This book 
provides numerous examples that illustrate the basic concepts. It is profusely 
illustrated with diagrams. While dealing with theorems and algorithms, the 
emphasis is on constructions. Each construction is immediately followed by an 
example and only then the formal proof is given so that the student can master 
the technique involved in the construction before taking up the formal proof. 

The key feature of the book that sets it apart from other books is the 
provision of detailed solutions (at the end of the book) to chapter-end 
exercises. 

The chapter on Propositions and Predicates (Chapter 10 of the second 
edition) is now the first chapter in the new edition. The changes in other 
chapters have been made without affecting the structure of the second edition. 
The chapter on Turing machines (Chapter 7 of the second edition) has 
undergone major changes. 

A novel feature of the third edition is the addition of objective type 
questions in each chapter under the heading Self-Test. This provides an 
opportunity to the student to test whether he has fully grasped the fundamental 
concepts. Besides, a total number of 83 additional solved examples have been 
added as Supplementary Examples which enhance the variety of problems 
dealt with in the book. 
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x B Preface 


The sections on pigeonhole principle and the principle of induction (both 
in Chapter 2) have been expanded. In Chapter 5, a rigorous proof of Kleene’s 
theorem has been included. The chapter on LR(fc) grammars remains the same 
Chapter 8 as in the second edition. 

Chapter 9 focuses on the treatment of Turing machines (TMs). A new 
section on high-level description of TM has been added and this is used in later 
examples and proofs. Some techniques for the construction of TMs have been 
added in Section 9.6. The multitape Turing machine and the nondeterministic 
Turing machine are discussed in Section 9.7. 

A new chapter (Chapter 10) on decidability and recursively enumerable 
languages is included in this third edition. In the previous edition only a 
sketchy introduction to these concepts was given. Some examples of 
recursively enumerable languages are given in Section 10.3 and undecidable 
languages are discussed in Section 10.4. The halting problem of TM is 
discussed in Section 10.5. Chapter 11 on computability is Chapter 9 of the 
previous edition without changes. 

Chapter 12 is a new chapter on complexity theory and /VP-complete 
problems. Cook’s theorem is proved in detail. A section on Quantum 
Computation is added as the last section in this chapter. Although this topic 
does not fall under the purview of theoretical computer science, this section 
is added with a view to indicating how the success of Quantum Computers will 
lead to dramatic changes in complexity theory in the future. 

The book fulfils the curriculum needs of undergraduate and postgraduate 
students of computer science and engineering as well as those of MCA courses. 
Though designed for a one-year course, the book can be used as a one- 
semester text by a judicious choice of the topics presented. 

Special thanks go to all the teachers and students who patronized this book 
over the years and offered helpful suggestions that have led to this new 
edition. In particular, the critical comments of Prof. M, Umaparvathi, 
Professor of Mathematics, Seethalakshmi College, Tiruchirapalli are gratefully 
acknowledged. 

Finally, the receipt of suggestions, comments and error reports for further 
improvement of the book would be welcomed and duly acknowledged. 

K.L.P. Mishra 
N. Chandrasekran 
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Notations 



Symbol 

Meaning 

Section in which the 
symbol appears first 
and is explained 

T 

Truth value 

1.1 

F 

False value 

1.1 

-i 

The logical connective NOT 

1.1 

A 

The logical connective AND 

1.1 

V 

The logical connective OR 

1.1 

=> 

The logical connective IF . . . THEN .. . 

1.1 

<=> 

The logical connective If and Only If 

1.1 

T 

Any tautology 

1.1 

F 

Any contradiction 

1.1 

V 

For every r 

1.4 

j 

There exists 

1.4 

= 

Equivalence of predicate formulas 

1.4 

Cl E A 

The element a belongs to the set A 

2.1.1 

A c S 

The set A is a subset of set B 

2.1.1 

0 

The null set 

2.1.1 

A u B 

The union of the sets A and B 

2.1.1 

A u B 

The intersection of the sets A and B 

2.1.1 

A - £ 

The complement of B in A 

2.1.1 

A r 

The complement of A 

2.1.1 

2 4 

The power set of A 

2.1.1 

A x £ 

The cartesian product of A and B 

2.1.1 


X I 
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Symbol 

Meaning 

Section in which the 
symbol appears first 
and is explained 


n 

U A, 

1=1 

The union of the sets A h A 2 , . . A n 

2.1.2 

*, 0 

Binar>- operations 

2.1.2, 2. 

xRy 

x is related to y under the relation 

2.1.4 

xR'y 

* is not related to y under the relation R 

2.1.4 

i = j modulo n 

i is congruent to j modulo n 

2.1.4 

C a 

The equivalence class containing a 

2.1.4 

R + 

The transitive closure of R 

2.1.5 

R* 

The reflexive-transitive closure of R 

2.1.5 

R\ 0 R 2 

The composite of the relations R\ and R 2 

2.1.5 

f: X -* Y 

Map/fimction from a set X to a set Y 

2.1.6 

fix) 

The image of x under / 

2.1.6 

Del 

The smallest integer > x 

2.2.2 

z* 

The set of all strings over the alphabet set I 

2.3 

A 

The empty string 

2.3 

X + 

The set of all nonempty strings over X 

2.3 

M 

The length of the string x 

2.3 

(l Q , 2. s, qo, F) 

A finite automaton 

3.2 

<r 

Left endmarker in input tape 

3.2 

$ 

Right endmarker in input tape 

3.2 

(2, 2, & flo, F) 

A transition system 

3.3 

(2, 2, A, 5, K qo) 

A Mealy/Moore machine 

3.8 

K 

Partition corresponding to equivalence of states 

3.9 


Partition corresponding to ^-equivalence of states 

3.9 

(V N , 2, P, S ) 

A grammar 

4.1.1 

a => P 

G 

a directly derives /3 in grammar G 

4.1.2 

a=> 0 

a derives p in grammar G 

4.1.2 

G 

a derives p in n steps in grammar G 

4.1.2 

L(G) 

The language generated by G 

4.1.2 

^0 

The family of type 0 languages 

4.3 

=^csl 

The family of context-sensitive languages 

4.3 

^cfl 

The family of context-free languages 

4.3 

4l 

The family of regular languages 

4.3 

Ri + R 2 

The union of regular expressions and R 2 

5.1 

RiR 2 

The concatenation of regular expressions Ri and R 2 5.1 

R* 

The iteration (closure) of R 

5.1 
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Symbol 

Meaning 

Section in which the 
symbol appears first 
and is explained 

a 

The regular expression corresponding to [a] 

5.1 

IKvl 

The number of elements in V N 

6.3 

(Q, i, r, 8, 

<?o> 2o. F ) 

A pushdown automaton 

7.1 

ID 

An instantaneous description of a pda 

7.1 

1 A 

A move relation in a pda A 

7.1 

(«2, I, r, 8 q 0 , 8 F) 

A Turing machine 

9.1 

p- 

The move relation in a TM 

9.2.2 

Tin) 

Maximum of the running time of M 

9.7.1 

^DFA 

Sets 

10.3 

A:t-g 

Sets 

10.3 

A:;sg 

Sets 

10.3 

^tm 

Sets 

10.4 

HALTj M 

Sets 

10.5 

Z(x) 

The image of x under zero function 

11.2 

S(x) 

The image of x under successor function 

11.2 

ur 

The projection function 

11.2 

nil (. x) 

The image of x under nil function 

11.2 

cons a(x) 

The concatenation of a and x 

11.2 

cons b(x) 

The concatenation of b and x 

11.2 

p(x) 

The image of x under predecessor function 

11.2 

- 

The proper subtraction function 

Example 11.8 

|-V| 

The absolute value of x 

11.2 

id 

Identity function 

11.2 


The minimization function 

11.3 

Za 

The characteristic function of the set A 

Exercise 11.8 

0(g(n)) 

The order of g(ji) 

12.1 


The order of n k 

12.1 

P and NP 

Classes 

12.2 

l^> 

quantum bit (qubit) 

12.8.1 

PSPACE 

Classes 

12.8.3 

EXP 

Classes 

12.8.3 

^PP 

Classes 

12.8.3 

BQP 

Classes 

12.8.3 

NPI 

Classes 

12.8.3 


http://engineeringbooks.net 



Propositions and 
Predicates 


Mathematical logic is the foundation on which the proofs and arguments rest. 
Propositions are statements used in mathematical logic, which are either true 
or false but not both and we can definitely say whether a proposition is true 
or false. 

In this chapter we introduce propositions and logical connectives. Normal 
forms for well-formed formulas are given. Predicates are introduced. Finally, 
we discuss the rules of inference for propositional calculus and predicate calculus. 

1.1 PROPOSITIONS (OR STATEMENTS) 

A proposition (or a statement) is classified as a declarative sentence to which 
only one of the truth values, i.e. true or false, can be assigned. When a 
proposition is true, we say that its truth value is T. When it is false, we say 
that its truth value is F. 

Consider, for example, the following sentences in English: 

1. New Delhi is the capital of India. 

2. The square of 4 is 16. 

3. The square of 5 is 27. 

4. Every college will have a computer by 2010 a.d. 

5. Mathematical logic is a difficult subject. 

6. Chennai is a beautiful city. 

7. Bring me coffee. 

8. No. thank you. 

9. This statement is false. 

The sentences 1-3 are propositions. The sentences 1 and 2 have the truth value 
T. The sentence 3 has the truth value F. Although we cannot know the truth 
value of 4 at present, we definitely know that it is true or false, but not both. 
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So the sentence 4 is a proposition. For the same reason, the sentences 5 and 
6 are propositions. To sentences 7 and 8, we cannot assign truth values as they 
are not declarative sentences. The sentence 9 looks like a proposition. 
However, if we assign the truth value T to sentence 9. then the sentence asserts 
that it is false. If we assign the truth value F to sentence 9, then the sentence 
asserts that it is true. Thus the sentence 9 has either both the truth values (or 
none of the two truth values). Therefore, the sentence 9 is not a proposition. 

We use capital letters to denote propositions. 

1.1.1 Connectives (Propositional Connectives 
or Logical Connectives) 

Just as we form new sentences from the given sentences using words like 
'and’, 'but’, ‘if, we can get new propositions from the given propositions 
using 'connectives’. But a new sentence obtained from the given propositions 
using connectives will be a proposition only when the new sentence has a truth 
value either T or F (but not both). The truth value of the new sentence 
depends on the (logical) connectives used and the truth value of the given 
propositions. 

We now define the following connectives. There are five basic 
connectives. 

(i) Negation (NOT) 

(ii) Conjunction (AND) 

(iii) Disjunction (OR) 

(iv) Implication (IF . . . THEN . t -Pjr'' 

(v) If and Only If. 

Negation (NOT) 

If P is a proposition then the negation P or NOT P (read as 'not P’) is a 
proposition (denoted by —t P) whose truth value is T if P has the truth value 
F, and whose truth value is F if P has the truth value T. Usually, the truth 
values of a proposition defined using a connective are listed in a table called 
the truth table for that connective (Table 1.1). 


TABLE 1.1 

Truth Table for Negation 

P 

nP 

T 

F 

F 

T 


Conjunction (AND) 

If P and Q are two propositions, then the conjunction of P and Q (read as ‘P 
and (F) is a proposition (denoted by P a Q) w'hose truth values are as given 
in Table 1.2. 
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TABLE 1.2 Truth Table for Conjunction 


p 

Q 

P A Q 

- T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 


Disjunction (OR) 

If P and Q are two propositions, then the disjunction of P and Q (read as T 
or Q 9 ) is a proposition (denoted by P v Q) whose truth values are as given 
in Table 1.3. 

TABLE 1.3 Truth Table for Disjunction 


P 

Q 

P v Q 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 


It should be noted that P v Q is true if P is true or Q is true or both are 
true. This OR is known as inclusive OR, i.e. either P is true or Q is true or 
both are true. Here we have defined OR in the inclusive sense. We will define 
another connective called exclusive OR (either P is true or Q is true, but not 
both, i.e. where OR is used in the exclusive sense) in the Exercises at the end 
of this chapter. 


EXAMPLE 1.1 


If P represents 'This book is good’ and Q represents k This book is cheap’, 
write the following sentences in symbolic form: 

(a) This book is good and cheap. 

(b) This book is not good but cheap. 

(c) This book is costly but good. 

(d) This book is neither good nor cheap. 

(e) This book is either good or cheap. 

Solution 

(a) P a Q 

(b) nP A Q 

(c) -i Q a P 

(d) —\ P a -,Q 

(e) P v Q 

Note: The truth tables for P a Q and Q a P coincide. So P a Q and 
Q a P are equivalent (for the definition, see Section 1.1.4). But in 
natural languages this need not happen. For example, the two sentences, 
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namely T went to the railway station and boarded the train’, and 4 I boarded 
the train and went to the railway station', have different meanings. Obviously, 
we cannot write the second sentence in place of the first sentence. 

Implication (IF ... THEN ...) 

If P and Q are two propositions, then 'IF P THEN O' is a proposition 
(denoted by P => Q) whose truth values are given Table 1.4. We also read 
P Q as 4 P implies Q\ 


TABLE 1.4 Truth Table for Implication 


p 

Q 

P => Q 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 


We can note that P => Q assumes the truth value F only if P has the truth 
value T and Q has the truth value F. In all the other cases, P => Q assumes 
the truth value T. In the case of natural languages, we are concerned about the 
truth values of the sentence 'IF P THEN Q’ only when P is true. When P 
is false, we are not concerned about the truth value of 'IF P THEN Q\ But 
in the case of mathematical logic, we have to definitely specify the truth value 
of P => Q in all cases. So the truth value of P Q is defined as T when 
P has the truth value F (irrespective of the truth value of Q). 


EXAMPLE 1.2 


Find the truth values of the following propositions: 

(a) If 2 is not an integer, then 1/2 is an integer. 

(b) If 2 is an integer, then 1/2 is an integer. 

Solution 

Let P and Q be ‘2 is an integer’, T/2 is an integer*, respectively. Then the 
proposition (a) is true (as P is false and Q is false) and the proposition (b) 
is false (as P is true and Q is false). 

The above example illustrates the following: 'We can prove anything if 
we start with a false assumption.’ We use P => Q whenever we want to 
'translate’ any one of the following: 4 P only if Q\ 'P is a sufficient condition 
for Q\ '0 is a necessary condition for P\ 'Q follows from P\ 4 Q whenever 
P\ Q provided P\ 

If and Only If 

If P and Q are two statements, then 'P if and only if O' is a statement 
(denoted by P <=> Q) whose truth value is T when the truth values of P and 
Q are the same and whose truth valufe is F when the statements differ. The 
truth values of P <=> Q are given in Table 1.5. 
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TABLE 1.5 Truth Table for if and Only If 



P 

Q 

P & Q 


T 

T 

T 


T 

F 

F 


F 

T 

F 


F 

F 

T 

Table 1.6 summarizes the representation and meaning of the five logical 
connectives discussed above. 


TABLE 1.6 

Five Basic Logical 

Connectives 

Connective 


Resulting proposition 

Read as 

Negation -i 


—i P 

Not P 

Conjunction a 


P A Q 

P and Q 

Disjunction v 


P V Q 

P or Q 
(or both) 

Implication => 


P => Q 

If P then Q 
(P implies Q) 

If and oniy if <t=> 

P Q 

P if and only if Q 


EXAMPLE 1.3 


Translate the following sentences into propositional forms: 

(a) If it is not raining and I have the time, then I will go to a movie. 

(b) It is raining and I will not go to a movie., 

(c) It is not raining. 

(d) I will not go to a movie. 

(e) I will go to a movie only if it is not raining. 

Solution 

Let P be the proposition It is raining 5 . 

Let Q be the proposition 1 have the time 5 . 

Let R be the proposition ‘I will go to a movie 5 . 

Then 

(a) (-i P a 0 => R 

(b) P a R 

(c) P 

(d) -n R 

(e) R => -i P 


EXAMPLE 1.4 


If P, Q, R are the propositions as given in Example 1.3. write the sentences 
in English corresponding to the following propositional forms: 
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(a) (-. P a Q) « R 

(b) (Q =* R) a (R => 0 

(c) -i (2 v J?) 

(d) r => p a e 

Solution 

(a) I will go to a movie if and only if it is not raining and I have the 
time. 

(b) I will go to a movie if and only if I have the time. 

(c) It is not the case that I have the time or I will go to a movie. 

(d) I will go to a movie, only if it is not raining or I have the time. 

1.1.2 Well-formed Formulas 

Consider the propositions P a Q and Q a P. The truth tables of these two 
propositions are identical irrespective of any proposition in place of P and any 
proposition in place of Q. So we can develop the concept of a propositional 
variable (corresponding to propositions) and well-formed formulas 
(corresponding to propositions involving connectives). 

Definition 1.1 A propositional variable is a symbol representing any 
proposition. We note that usually a real variable is represented by the symbol 
x. This means that jt is not a real number but can take a real value. Similarly, 
a propositional variable is not a proposition but can be replaced by a 
proposition. 

Usually a mathematical object can be defined in terms of the property/ 
condition satisfied by the mathematical object. Another way of defining a 
mathematical object is by recursion. Initially some objects are declared to 
follow the definition. The process by which more objects can be constructed 
is specified. This way of defining a mathematical object is called a recursive 
definition. This corresponds to a function calling itself in a programming 
language. 

The factorial n\ can be defined as n(n - 1) ... 2.1. The recursive 
definition of n\ is as follows: 

0! = 1, n\ = n(n - 1)! 

Definition 1.2 A well-formed formula (wff) is defined recursively as 
follows: 

(i) If P is a propositional variable, then it is a wff. 

(ii) If a is a wff, then —i a is a wff 

(iii) If a and j3 are well-formed formulas, then (a v /J), (a a /J), (a ft), 
and (a <=> ft) are well-formed formulas. 

(iv) A string of symbols is a wff if and only if it is obtained by a finite 
number of applications of (i)-(iii). 
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Notes: (1) A wff is not a proposition, but if we substitute a proposition in 
place of a propositional variable, we get a proposition. For example: 

(i) —s (P v Q) a Q a R) => Q is a wff. 

(ii) (-i F a Q) « Q is a wff. 

(2) We can drop parentheses when there is no ambiguity. For example, in 
propositions we can remove the outermost parentheses. We can also specify the 
hierarchy of connectives and avoid parentheses. 

For the sake of convenience, we can refer to a wff as a formula. 


1.1.3 Truth Table for a Well-formed Formula 

If we replace the propositional variables in a formula a by propositions, we 
get a proposition involving connectives. The table giving the truth values of 
such a proposition obtained by replacing the propositional variables by 
arbitrary propositions is called the truth table of a. 

If a involves n propositional constants, then we have 2 n possible 
combinations of truth values of propositions replacing the variables. 


EXAMPLE 1.5 


Obtain the truth table for a = (P v Q) a (P Q) a (Q P). 

Solution 

The truth values of the given wff are shown imTable 1.7. 


TABLE 1.7 Truth Table of Example 1.5 


p 

Q 

PvQ 

P => Q 

(P v Q) a (P => Q) 

(Q => P) 

a 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

F 

T 

F 

F 

T 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

F 


EXAMPLE 1.6 


Construct the truth table for a= (P v Q) =* ((P v R) => (R v Q)). 

Solution 

The truth values of the given formula are shown in Table 1.8. 
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TABLE 

1.8 Truth Table of Example 1.6 



p 

Q 

R 

P V R 

Tj 

< 

o 

(P v R) =* (R v Q) 

(P v Q) 

a 

7 

7 

7 

T 

7 

7 

7 

7 

7 

7 

F 

T 

7 

7 

7 

7 

7 

F 

7 

T 

7 

7 

7 

7 

7 

F 

F 

T 

F 

F 

7 

F 

F 

7 

T 

T 

7 

7 

7 

7 

F 

7 

F 

F 

7 

7 

7 

7 

F 

F 

7 

7 

7 

7 

F 

7 

F 

F 

F 

F 

F 

7 

F 

7 


Some formulas have the truth value T for all possible assignments of truth 
values to the propositional variables. For example, P v P has the truth value 
T irrespective of the truth value of P. Such formulas are called tautologies . 

Definition 1.3 A tautology or a universally true formula is a well-formed 
formula whose truth value is T for all possible assignments of truth values to 
the propositional variables. 

For example. P v —■ P, (P a Q) => P, and ((P => Q) a (Q => R)) => 
(P ==> R) are tautologies. 

A 7 ote: When it is not clear whether a given formula is a tautology, we can 
construct the truth table and verify that the truth value is T for all combinations 
of truth values of the propositional variables appearing in the given formula. 


EXAMPLE 1.7 


Show that a - (P => (O => R)) => ((P => Q) =» (£_j=xf?)) is a tautology. 

Solution 

We give the truth values of a in Table 1.9. 


TABLE 1.9 Truth Table of Example 1.7 


P 

Q 

R 

Q => R 

P =£ (Q R) 

P => Q 

P =$ R 

(P =* Q) => (P => R) 

a 

7 

T 

7 

7 

T 

7 

7 

7 

7 

7 

7 

F 

F 

F 

7 

F 

F 

7 

7 

F 

7 

7 

7 

F 

7 

7 

7 

7 

F 

F 

7 

7 

F 

F 

7 

7 

F 

7 

7 

7 

7 

7 

7 

7 

T 

F 

7 

F 

F 

7 

7 

7 

7 

7 

F 

F 

7 

7 

7 

7 

7 

7 

7 

F 

F 

F 

7 

7 

7 

7 

7 

7 


Definition 1.4 A contradiction (or absurdity) is a wff whose truth value is 
F for all possible assignments of truth values to the propositional variables. 
For example, 

Pa —i P and (P a Q_ ) a —i Q 

are contradictions. 

Note: a is a contradiction if and only if -i a is a tautology. 
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1,1.4 Equivalence of Well-formed Formulas 

Definition 1.5 Two wffs a and /3 in propositional variables P b P 2 , .. P u 
are equivalent (or logically equivalent) if the formula a <=> /? is a tautology. 
When a and j3 are equivalent, we write a h f3. 

Notes: (1) The wffs a and j3 are equivalent if the truth tables for a and /? are 
the same. For example. 

P a Q = O a P and = F 

(2) It is important to note the difference between a <=> [3 and a = j3. 
a <=> P is a formula, whereas a = /3 is not a formula but it denotes the relation 
between of and /?. 


EXAMPLE 1.8 


Show that (P => (0 v P)) = ((P =» 0 v (P => P)). 

Solution 

Let a = (P => (2 v P)) and [3 = ((P => 0 v (P => P)). We construct the 
truth values of a and (3 for all assignments of truth values to the variables P, 
<2 and P. The truth values of a and j3 are given in Table 1.10. 


TABLE 1.10 Truth Table of Example 1.8 


p 

Q 

R 

Q v R 

P => (Q v R) 

P => Q 

P => R 

(, P => Q) v (P => R) 

T 

T 

T 

T 

T 

T 

"T~~ 

T 

T 

T 

F 

T 

T 

T 

F 

T 

T 

F 

T 

T 

T 

F 

T 

T 

T 

F 

F 

F 

F 

F 

F 

F 

F 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

T 

T 

T 

T 

F 

F 

T 

T 

T 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

T 


As the columns corresponding to a and /3 coincide, a = /?. 

As the truth value of a tautology is T, irrespective of the truth values of 
the propositional variables, we denote any tautology by T. Similarly, we 
denote any contradiction by F. 


1.1.5 Logical Identities 

Some equivalences are useful for deducing other equivalences. We call them 
identities and give a list of such identities in Table 1.11. 

The identities / t —/ 12 can be used to simplify formulas. If a formula /3 is 
part of another formula a, and j3 is equivalent to /T, then we can replace (3 
by /3' in a and the resulting wff is Equivalent to a 
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TABLE 1.11 Logical Identities 


u 

Idempotent laws: 

PvPeP, P a P = P 

h 

Commutative laws: 

P v Q = Q v P, P a G = Q a P 

h 

Associative laws: 

P V (Q V R) = (P V Q) v R, P A (Q A P) = (P A Q) a R 

U 

Distributive laws: 

P v (Q a R) s (P v Q) a (P v R), P a (Q v R) = (P a Q) v (P a R) 

h 

Absorption laws: 

Pv(PaQ)=.P Pa(PvQ)eP 

h 

DeMorgan’s laws: 

—i (P v Q) s —i P a — i Q, —i (P a Q) — — i P v —t Q 

h 

Double negation: 

P^hP) 

4 

PvnP = T, Pa -i P = F 

h 

P v T = T, p A l = p, P v F s P, P A F s F 

4o 

(P =» Q) a (P => -i Q) = -i P 

/ii 

Contrapositive: 

P => Q = —i Q => —i P 

42 

PsQMnPvQ) 


EXAMPLE 1.9 


Show that (P a Q) v (P a -i Q) = P. 

Solution 


L.H.S. = (P a Q) v (P a — i (2) 


= P A (£ V -n 0 

by using the distributive law (i.e. / 4 ) 

III 

> 

H 

by using /§ 

= p 

by using / 9 

= R.H.S. 



EXAMPLE 1.10 


Show that (P => 0 a (/? => Q) s (P 

Solution 

L.H.S. = (P =* Q) a (R => Q) 

= (-.P v 0 a v 0 by 
= (Q v P) a (Q v -i P) by 
= <2 v (— i Pa — i R) by 

S Q V (—I (P v P)) by 

— ( * (P v P)) V (2 by 

= (P v R) =$ Q ,by 

= R.H.S. 


v 7?) 


G 


using 

^12 



using 

the 

commutative 

law (i.e. / 2 ) 

using 

the 

distributive law (i.e. / 4 ) 

using 

the 

DeMorgan’s 

law (i.e. / 6 ) 

using 

the 

commutative 

law (i.e. / 2 ) 

using 

A 2 
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1.2 NORMAL FORMS OF WELL-FORMED FORMULAS 

We have seen various well-formed formulas in terms of two propositional 
variables, say, P and Q. We also know that two such formulas are equivalent 
if and only if they have the same truth table. The number of distinct truth 
tables for formulas in P and Q is 2 4 . (As the possible combinations of truth 
values of P and 0 are TT } TF ' FT, FF, the truth table of any formula in P 
and Q has four row's. So the number of distinct truth tables is 2 4 .) Thus there 
are only 16 distinct (nonequivalent) formulas, and any formula in P and Q is 
equivalent to one of these 16 formulas. 

In this section we give a method of reducing a given formula to an 
equivalent form called the ’normal form'. We also use ‘sum’ for disjunction, 
‘product’ for conjunction, and ‘literal’ either for P or for —i P, where P is any 
propositional variable. 

Definition 1.6 An elementary product is a product of literals. An elementary 
sum is a sum of literals. For example, Pa— \Q, —iP a —iQ, Pa Q, —i P a Q 
are elementary products. And P v — \Q, P v —i R are elementary sums. 

Definition 1.7 A formula is in disjunctive normal form if it is a sum of 
elementary products. For example, P v (Q a R) and P v (-i Q a R) are in 
disjunctive normal form. P a (Q v R) is not in disjunctive normal form. 


1.2.1 Construction to Obtain a Disjunctive Normal 
Form of a Given Formula 

Step 1 Eliminate and using logical identities. (We can use / 12 , he. 

P => Q = (r^P v 0.) 

Step 2 Use DeMorgan’s laws (/ 6 ) to eliminate —i before sums or products. 
The resulting formula has —i only before the propositional variables, i.e. it 
involves sum, product and literals. 

Step 3 Apply distributive laws (/ 4 ) repeatedly to eliminate the product of 
sums. The resulting formula will be a sum of products of literals, i.e. sum of 
elementary' products. 


EXAMPLE 1.11 


Obtain a disjunctive normal form of 

P v (-, /> => {Q v (Q => R))) 

Solution 

p v (-, P => (Q v (Q => R))) 

e p v (n/’ => (2 v (-><2v—i R))) (step 1 using / 12 ) 

= P v (P v (<2 v (-i Q v -! R))) (step 1 using / 12 and / 7 ) 
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= PvPvQv^Qv—\R by using / 3 

= PvQ\z~^Qv~iR by using ij 

Thus, PvQv-iOv—iPisa disjunctive normal form of the given formula. 


EXAMPLE 1.12 


Obtain the disjunctive normal form of 

(P A - (G A /?)) v (P => 0 

Solution 

(P A i (Q A P)) v (P => 0 

= (P a -i (<2 a P) v (—i P v Q) (step 1 using / 12 ) 

s (P a (-i (3 v -i R)) v (-i P v 0 (step 2 using 1 7 ) 

s(Pa-iQ)v(Pa-i/?)v-iPvQ (step 3 using / 4 and / 3 ) 

Therefore, (P a — i 0 v (P a -i P) v — i P v 0 is a disjunctive normal form 
of the given formula. 

For the same formula, we may get different disjunctive normal forms. For 
example, (P a Q a R) v (P a Q a — i R) and P a Q are disjunctive normal 
forms of P a Q. So, we introduce one more normal form, called the principal 
disjunctive normal form or the sum-of-products canonical form in the next 
definition. The advantages of constructing the principal disjunctive normal 
form are: _ 

(i) For a given formula, its principal disjunctive normal form is unique. 

(ii) Two formulas are equivalent if and only if their principal disjunctive 
normal forms coincide. 

Definition 1.8 A minterm in n propositional variables P h . . P n is 
Qi a 0 • • * A Qn> where each Q } is either P i or —> P z . 

For example, the minterms in P] and P 2 are Pj a P 2 , —i Pi a P 2 , 
P| a —i Py —iPj a — i P 2 . The number of minterms in n variables is 2 n . 

Definition 1.9 A formula a is in principal disjunctive normal form if a is 
a sum of minterms. 

1.2.2 Construction to Obtain the Principal 

Disjunctive Normal Form of a Given Formula 

Step 1 Obtain a disjunctive normal form. 

Step 2 Drop the elementary products which are contradictions (such as 
P A P). 

Step 3 If Pi and -i P, are missing in an elementary product a, replace a by 
(a a Pj) v (a a -i Pj). 
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Step 4 Repeat step 3 until all the elementary products are reduced to sum 
of minterms. Use the idempotent laws to avoid repetition of minterms. 


EXAMPLE 1.13 


Obtain the canonical sum-of-products form (i.e. the principal disjunctive 
normal form) of 

a = P v (— Ai?) 

Solution 

Here a is already in disjunctive normal form. There are no contradictions. So 
we have to introduce the missing variables (step 3). -i P a — ■ Q a R in a is 
already a min term. Now, 

P S (P A 0 V (P A -i Q) 

s ((P A Q A P) V (P A Q A P)) V (P A Q A P) V (P A Q A 1 P) 

= ((P A 0 A P) V (P A (2 A -1 R)) V ((P A -1 Q A P) V (P A 0 A i R)) 

Therefore, the canonical sum-of-products form of a is 

(P a Q a R) v OP a Q a R) v (P a Q a /?) 
vCPa-.Ga-i/JJv^Pa-.Ga/?) 


EXAMPLE 1.14 


Obtain the principal disjunctive normal form of 

Ot — (—t P v —i Q) (—i P a R) 

Solution 

GC — (—i P V —i O) (—i P A P) 

= P v -i Q)) v (-n P a P) by using / l2 
= (P a 0 v (— i P a P) by using DeMorgan’s law 

= ((P A Q A P) V (P A 2 A -1 P» V ((-, P A P A Q) V (-, P A P A Q)) 

= (P A 2 A P) V (P A Q A -1 P) V (-, P A e A P) V (-, P A i Q A P) 

So, the principal disjunctive normal form of a is 
(P A Q A P) V (P A Q A —i P) V (—1 P A Q A P) V (—i P A —i Q A P) 

A minterm of the form Qi a Q 2 a • • • a Q n can be represented by 

a { a 2 . . . where = 0 if Q { = -1 P z and a-, = 1 if <2; = P 7 . So the principal 

disjunctive normal form can be represented by a 'sum* of binary strings. For 
example, (P a Q a P) v (P a 0 a —i P) v (—i P a -i Q a P) can be represented 
by 111 v 110 v 001. 

The minterms in the two variables P and Q are 00, 01, 10, and 11. Each 
wff is equivalent to its principal disjunctive normal form. Every principal 
disjunctive normal form corresponds to the minterms in it, and hence to a 
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subset of {00, 01, 10, 11}. As the number of subsets is 2 4 , the number of 
distinct formulas is 16. (Refer to the remarks made at the beginning of this 
section.) 

The truth table and the principal disjunctive normal form of a are closely 
related. Each minterm corresponds to a particular assignment of truth values 
to the variables yielding the truth value T to a. For example, P a Q a —■ R 
corresponds to the assignment of T, T, F to P, Q and R , respectively. So, if 
the truth table of a is given, then the minterms are those which correspond 
to the assignments yielding the truth value T to a 


EXAMPLE 1.15 


For a given formula a , the truth values are given in Table 1.12. Find the 
principal disjunctive normal form. 


TABLE 1.12 Truth Table of Example 1.15 


p 

Q 

R 

a 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

T 

F 

T 

F 

F 

T 

F 

T 

T 

T 

F 

T 

F 

F 

F 

F 

T 

F 

F 

F 

F 

T 


Solution 

We have T in the a-column corresponding to the rows 1, 4, 5 and 8. The 
minterm corresponding to the first row is P a Q a R. 

Similarly, the minterms corresponding to rows 4, 5 and 8 are respectively 
Fa —\ Q a R, —{ P a Q a R and —* P a —i Q a —i R. Therefore, the principal 
disjunctive normal form of a is 

(jP a Q a R) v (P a —i Q a —i R) v (—i P a Q a R ) v (—i Pa —i Q a —i R) 

We can form the ‘dual' of the disjunctive normal form which is termed the 
conjunctive normal form. 

Definition 1.10 A formula is in conjunctive normal form if it is a product 
of elementary sums. 

If a is in disjunctive normal form, then —i a is in conjunctive normal 
form. (This can be seen by applying the DeMorgan’s laws.) So to obtain the 
conjunctive normal form of a, we construct the disjunctive normal form of 
—j a and use negation. 

Definition 1.11 A maxterm in n propositional variables P h P 2 , • • P n is 
Q\ v Qi v * ’ ’ v Qw where each Q } is either P f or -i P r 
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Definition 1.12 A formula a is in principal conjunctive normal form if a 
is a product of maxterms. For obtaining the principal conjunctive normal form 
of a, we can construct the principal disjunctive normal form of —i a and apply 
negation. 


EXAMPLE 1.16 


Find the principal conjunctive normal form of a — P v (Q => R). 

Solution 

na=n(Pv(2=^ R)) 

— i (F v ( i Q v R')') by using J 12 

= —i P a (—1 (—1 Q v R )) by using DeMorgan’s law 

s —, P a (Q a -i R) by using DeMorgan’s law and / 7 

—i P a Q a —! R is the principal disjunctive normal form of — i a. Hence, 
the principal conjunctive normal form of a is 

—i (—i P a Q a —s R) — P v —i Q v R 

The logical identities given in Table 1.11 and the normal forms of well-formed 
formulas bear a close resemblance to identities in Boolean algebras and normal 
forms of Boolean functions. Actually, the propositions under v, a and — i form 
a Boolean algebra if the equivalent propositions are identified. T and F act as 
bounds (i.e. 0 and 1 of a Boolean algebra). Also, the statement formulas form 
a Boolean algebra under v, a and -i if the equivalent formulas are identified. 

The normal forms of well-formed formulas correspond to normal forms 
of Boolean functions and we can ’minimize’ a formula in a similar manner. 

1.3 RULES OF INFERENCE FOR PROPOSITIONAL 
CALCULUS (STATEMENT CALCULUS) 

In logical reasoning, a certain number of propositions are assumed to be true, 
and based on that assumption some other propositions are derived (deduced or 
inferred). In this section we give some important rules of logical reasoning or 
rules of inference. The propositions that are assumed to be true are called 
hypotheses or premises. The proposition derived by using the rules of inference 
is called a conclusion. The process of deriving conclusions based on the 
assumption of premises is called a valid argument. So in a valid argument we 
are concerned with the process of arriving at the conclusion rather than 
obtaining the conclusion. 

The rules of inference are simply tautologies in the form of implication 
(i.e. P => 0. For example. P =» (P v Q) is such a tautology, and it is a rule 

P 

of inference. We write this in the form —-——. Here P denotes a premise. 

P v Q 

The proposition below the line. i.e. P v Q is the conclusion. 
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We give in Table 1.13 some of the important rules of inference. Of course, 
we can derive more rules of inference and use them in valid arguments. 

For valid arguments, we can use the rules of inference given in 
Table 1.13. As the logical identities given in Table 1.11 are two-way 
implications, we can also use them as rules of inference. 


TABLE 1.13 Rules of Inference 


Rule of inference 

Implication form 

Rif Addition 

P 

P v Q 

P => (P v Q) 

Rif Conjunction 


P 

Q 

P a Q 

P A Q => P A Q 

Rif Simplification 

PaQ 

P 

(PaQ)=>P 

Rif Modus ponens 

P 

P=> Q 

Q 

(P A (P => Q)) => Q 

Rif. Modus tollens 

Q 

P=> Q 

-iP 

(-i Q a (P => Q)) => -iQ 

P/ 6 : Disjunctive syllogism 


^P 

PvQ 

0 

(-. P a (P v Q)) => Q 

Rif. Hypothetical syllogism 

P =$> Q 

Q => R 

.-. P=> R 

((P => Q) a (Q => P)) => (P => P) 

Rif Constructive dilemma 

(P => Q) A (R => S) 

P v R 

:.QvS 

((P => Q) a (P => S) a (P v P)) => (Q v S) 

Rif. Destructive dilemma 

p => Q) A (R =* S) 

nQv-S 

P v P 

((P => Q) A (P => S) A (— i Q V — 1 S)) =» (-1 P V -I P) 
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EXAMPLE 1.17 


Can we conclude S from the following premises? 

(i ) P => Q 

(ii) P => R 

(iii) -n (Q A P) 

(iv) S v P 


Solution 


The valid argument for deducing S from the given four premises is given as 
a sequence. On the left, the well-formed formulas are given. On the right, we 
indicate whether the proposition is a premise (hypothesis) or a conclusion. If 
it is a conclusion, we indicate the premises and the rules of inference or logical 
identities used for deriving the conclusion. 


L P =* Q 

2. P => R 

3. (P => Q) a CP => R) 

4. -i (Q a R) 

5. -,Q v -ifl 

6. nPVnP 

7. —i P 

8 . 5 v P 

9. 5 


Premise (i) 

Premise (ii) 

Lines 1. 2 and PP 
Premise (iii) 

Line 4 and DeMorgan’s law (Z 6 ) 

Lines 3. 5 and destructive dilemma (P/ 9 ) 
Idempotent law /j 
Premise (iv) 

Lines 7. 8 and disjunctive syllogism P / 6 


Thus, we can conclude 5 from the given premises. 


\ 


EXAMPLE 1.18 


Derive S from the following premises using a valid argument: 

(i) P => O 

(ii) Q => -i/? 

(iii) P v 5 

(iv) P 


Solution 


1 . p => e 

2. 2 => iP 

3. p => -,p 

4. p 

5. —1 (—i p) 

6 . nP 

7. P v 5 

8 . 5 


Premise (i) 

Premise (ii) 

Lines 1, 2 and hypothetical syllogism P / 7 
Premise (iv) 

Line 4 and double negation h 
Lines 3. 5 and modus tollens Rh 
Premise (iii) 

Lines 6 . 7 and disjunctive syllogism P / 6 


Thus, we have derived 5 from the given premises. 
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EXAMPLE 1.19 


Check the validity of the following argument: 

If. Ram has completed B.E. (Computer Science) or MBA, then he is 
assured of a good job. If Ram is assured of a good job, he is happy. Ram is 
not happy. So Ram has not completed MBA. 

Solution 

We can name the propositions in the following way: 

P denotes ‘Ram has completed B.E. (Computer Science)’. 

Q denotes ‘Ram has completed MBA'. 

R denotes ‘Ram is assured of a good job’. 

S denotes ‘Ram is happy’. 

The given premises are: 

(i) (Pv0^i? 

(ii) R => 5 

(iii) -i S 

The conclusion is —i Q. 


1. (P V Q) => R 

Premise (i) 

2. R =*> 5 

Premise (ii) 

3. (P v Q) => S 

Lines 1, 2 and hypothetical syllogism R1 

4. -i S 

Premise (iii) 

5. (P v 0 

Lines 3, 4 and modus tollens RI 5 

6. —i P A -i Q 

DeMorgan’s law 4 

7. —i <2 

Line 6 and simplification R/ 3 


Thus the argument is valid. 


EXAMPLE 1.20 


Test the validity of the following argument: 

If milk is black then every cow is white. If every cow is white then it has 
four legs. If every cow has four legs then every buffalo is white and brisk. 
The milk is black. 

Therefore, the buffalo is white. 

Solution 

We name the propositions in the following way: 

P denotes ‘The milk is black’. 

Q denotes ‘Every 7 cow is white’. 

R denotes ‘Every cow has four legs’. 

S denotes ‘Every buffalo is white’. 

T denotes ‘Every buffalo is brisk’. 
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The given premises are: 

(i) P =* Q 

(ii) Q => R 

(iii) R => S a T 

(iv) P 

The conclusion is S. 

L P 

2. P =* Q 

3. G 

4 . R 

5. R 

6. r => 5 a r 

1. S aT 
8. 5 

Thus the argument is 


Premise (iv) 
Premise (i) 

Modus ponens RU 
Premise (ii) 

Modus ponens RI 4 
Premise (iii) 
Modus ponens RI 4 
Simplification Rh 
valid. 


1.4 PREDICATE CALCULUS 

Consider two propositions 'Ram is a student’, and 'Sam is a student'. As 
propositions, there is no relation between them, but we know they have 
something in common. Both Ram and Sam share the property of being a 
student. We can replace the two propositions by a single statement *x is a 
student’. By replacing a by Ram or Sam (or any other name), we get many 
propositions. The common feature expressed by ‘is a student’ is called a 
predicate . In predicate calculus we deal with sentences involving predicates. 
Statements involving predicates occur in mathematics and programming 
languages. For example. '2x + 3y = 4c\ TF (D. GE. 0.0) GO TO 20’ are 
statements in mathematics and FORTRAN, respectively, involving predicates. 
Some logical deductions are possible only by ‘separating’ the predicates. 


1.4.1 Predicates 

A part of a declarative sentence describing the properties of an object or 
relation among objects is called a predicate. For example, 'is a student’ is a 
predicate. 

Sentences involving predicat^s^describing the property of objects are 
denoted by P(x), where P denotes the predicate and x is a variable denoting 
any object. For example. P(x) can denote x is a student’. In this sentence, x 
is a variable and P denotes the predicate ‘is a student’. 

The sentence A is the father of y* also involves a predicate ‘is the father 
of. Here the predicate describes the relation between two persons. We can 
write this sentence as Fix, v). Similarly, 2x + 3v - 4 z can be described by 
S{x, y, z). ‘ : 
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Note: Although P(x) involving a predicate looks like a proposition, it is not 
a proposition. As P(x) involves a variable x, we cannot assign a truth value 
to P(x). However, if we replace x by an individual object, we get a 
proposition. For example, if we replace x by Ram in P(x), we get the 
proposition ‘Ram is a student'. (We can denote this proposition by P( Ram).) 
If we replace x by f A cat’, then also we get a proposition (whose truth value is 
F). Similarly, 5(2, 0, 1) is the proposition 2*2+3*0=41 (whose truth 
value is T). Also, 5(1, 1, 1) is the proposition 21 + 3*1=41 (whose 
truth value is F). 

The following definition is regarding the possible ‘values’ which can be 
assigned to variables. 

Definition 1.13 For a declarative sentence involving a predicate, the 
universe of discourse, or simply the universe, is the set of all possible values 
which can be assigned to variables. 

For example, the universe of discourse for P(x): ‘x is a student’, can be 
taken as the set of all human names; the universe of discourse for E(n ): ‘n is 
an even integer’, can be taken as the set of all integers (or the set of all real 
numbers). 

Note: In most examples, the universe of discourse is not specified but can be 
easily given. 

Remark We have seen that by giving values to variables, we can get 
propositions from declarative sentences involving predicates. Some sentences 
involving variables can also be assigned truth values. For example, consider 
There exists x such that x 2 = 5’, and ‘For all x, x 2 = (-x) 2 ’. Both these 
sentences can be assigned truth values (T in both cases). ‘There exists’ and 
‘For all’ quantify the variables. 

Universal and Existential Quantifiers 

The phrase ‘for all’ (denoted by V) is called the universal quantifier. Using 
this symbol, we can write ‘For all x, x 2 = (-x) 2 ’ as Vx Q(x), where Q(x) is 
A 2 = (-x) 2 ’. 

The phrase ‘there exists’ (denoted by 3) is called the existential quantifier. 
The sentence ‘There exists x such that x 2 = 5’ can be written as 3xi?(x), 
where R{x) is ‘x 2 = 5’. 

P(x) in Vx P(x) or in 3x P(x) is called the scope of the quantifier V or 3. 

Note: The symbol V can be read as ‘for every’, ‘for any’, ‘for each’, 
‘for arbitrary 7 ’. The symbol 3 can be read as ‘for some’, for ‘at least one’. 

When we use quantifiers, we should specify the universe of discourse. If 
we change the universe of discourse, the. truth value may change. For example, 
consider 3x R(x) y where R(x) is x 2 = 5. If the universe of discourse is the set 
of all integers, then 3x R(x) is false. If the universe of discourse is the set of 
all real numbers, then 3 xR(x) is true (when x = ±J~5 , x 2 = 5). 
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The logical connectives involving predicates can be used for declarative 
sentences involving predicates. The following example illustrates the use of 
connectives. 


EXAMPLE 1.21 


Express the following sentences involving predicates in symbolic form: 

1. All students are clever. 

2. Some students are not successful. 

3. Every clever student is successful. 

4. There are some successful students who are not clever. 

5. Some students are clever and successful. 

Solution 

As quantifiers are involved, we have to specify the universe of discourse. We 
can take the universe of discourse as the set of all students. 

Let C(x) denote hr is clever'. 

Let S(r) denote ; r is successful’. 

Then the sentence 1 can be written as Vr C(r). The sentences 2-5 can be 
written as 

3x S(r)), Vr (C(r) => S(x)), 

3x (S(x) a -i C(r)), 3x (C(x) -i S(r)) 

1.4.2 Well-formed Formulas of Predicate Calculus 

A well-formed formula (wff) of predicate calculus is a string of variables such 
as X\. X 2 , • • - connectives, parentheses and quantifiers defined recursively 
by the following rules: 

(i) P(x 1? ..., x n ) is a wff. where P is a predicate involving n variables 
*i> * 2 > • • •» Ar 

(ii) If a is a wff. then —i a is a wff. 

(iii) If a and /? are wffs. then a v /?, a a /3, a => fi, a <=> f3 are also 
wffs. 

(iv) If or is a wff and x is any vapabTe. then Vr (a), 3x (a) are wffs. 

(v) A string is a wff if and only if it is obtained by a finite number of 
applications of rules (i)—(iv). 

Note: A proposition can be viewed as a sentence involving a predicate with 0 
variables. So the propositions are wffs of predicate calculus by rule (i). 

We call wffs of predicate calculus as predicate formulas for convenience. 
The well-formed formulas introduced in Section 1.1 can be called proposition 
formulas (or statement formulas) to distinguish them from predicate formulas. 
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Definition 1.14 Let a and be two predicate formulas in variables X\, . . 
x n , and let U be a universe of discourse for a and /?. Then a and /? are 
equivalent to each other over U if for every possible assignment of values to 
each variable in a and /3 the resulting statements have the same truth values. 
We can write a - [5 over U. 

We say that a and (5 are equivalent to each other (a = (3) if a = (3 over 
U for every universe of discourse U. 

Remark In predicate formulas the predicate variables may or may not be 
quantified. We can classify the predicate variables in a predicate formula, 
depending on whether they are quantified or not. This leads to the following 
definitions. 

Definition 1.15 If a formula of the form 3 xP(x) or VxP(x) occurs as part 
of a predicate formula oc, then such part is called an x-bound part of a, and 
the occurrence of x is called a bound occurrence of x. An occurrence of x is 
free if it is not a bound occurrence. A predicate variable in a is free if its 
occurrence is free in any part of ol 

In a = (Bx { P{x { . x 2 )) a (Vx 2 Q(x 2 , x 3 )), for example, the occurrence of 
Xj in 3x\ P(x h X 2 ) is a bound occurrence and that of x 2 is free. In Vx 2 Q(x^ x 3 ), 
the occurrence of x 2 is a bound occurrence. The occurrence of x 3 in a is 
free. 

Note: The quantified parts of a predicate formula such as Vx P(x) or 3x P(x) 
are propositions. We can assign values from the universe of discourse only 
to the free variables in a predicate formula ct 

Definition 1.16 A predicate formula is valid if for all possible assignments 
of values from any universe of discourse to free variables, the resulting 
propositions have the truth value T\ 

Definition 1.17 A predicate formula is satisfiable if for some assignment of 
values to predicate variables the resulting proposition has the truth value T. 

Definition 1.18 A predicate formula is unsatisfiable if for all possible 
assignments of values from any universe of discourse to predicate variables the 
resulting propositions have the truth value F. 

We note that valid predicate formulas correspond to tautologies among 
proposition formulas and the unsatisfiable predicate formulas correspond to 
contradictions. 
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1.5 RULES OF INFERENCE FOR PREDICATE 
CALCULUS 

Before discussing the rules of inference, we note that: (i) the proposition 
formulas are also the predicate formulas; (ii) the predicate formulas (where all 
the variables are quantified) are the proposition formulas. Therefore, all the 
rules of inference for the proposition formulas are also applicable to predicate 
calculus wherever necessary. 

For predicate formulas not involving connectives such as A(x), P(x, y), we 
can get equivalences and rules of inference similar to those given in 
Tables 1.11 and 1.13. For Example, corresponding to / 6 in Table 1.11 we get 
(P(x) v Q(x)) = — i (P(a ; )) a -i (0 (a)). Corresponding to i?/ 3 in Table 1.13 
P a 0 => P, we get P(x) a Q(x) => P( x). Thus we can replace propositional 
variables by predicate variables in Tables 1.11 and 1.13. 

Some necessary equivalences involving the two quantifiers and valid 
implications are given in Table 1.14. 


TABLE 114 Equivalences Involving Quantifiers 


/13 

Distributivity of d over v: 

±x(P(x) v Q(x)) = 3xP(x) v 3.v Q(.v) 

3x(P v Q(.r)) = P v (3 x Q(x)) 

I<4 

Distributivity of V over a: 

V.v (P(x) a Q(x)) ^ V.vP(.v) a Va-Q(.v) 

V.v (P a Q(.v)) - P a (V.v Q(x)) 

7l5 

-(3* P(v)) = V.v -i (P{x)) 

/16 

(V.v P(x)) = 3x~.(P(x)) 

A 7 

3.v (P a Q(.v)) mP a (3x Q(.v)) 

As 

V.r (P v Q(.r)) = P v (V.v Q(x)) 

R1 io 

V.rP(.v) 3xP(x) 

Rl ii 

V.rP(.v) v V.v Q(.v) => V.v (P(x) v Q(.v)) 

R/,2 

3x (P(.v) A Q(.v)) => av P(x) A a.v Q(.v) 


Sometimes when we wish to derive someneonclusion from a given set of 
premises involving quantifiers, we may have to eliminate the quantifiers 
before applying the rules of inference for proposition formulas. Also, when 
the conclusion involves quantifiers, we may have to introduce quantifiers. The 
necessary rules of inference for addition and deletion of quantifiers are given 
in Table 1.15. 
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TABLE 1.15 Rules of inference for Addition and 



Deletion of Quantifiers 

^13 : 

Universal instantiation 

V.t P(x) 

■ P(c) 

c is some element of the universe. . 

Rl u : 

Existential instantiation 

—v P(x) 

■■■ P(c) 

c is some element for which P(c) is true. 

«/ 15 : 

Universal generalization 

P(x) 

V xP(.x) 

x should not be free in any of the given premises. 


Existential generalization 

P(c) 

3 xP(x) 

c is some element of the universe. 


EXAMPLE 1.22 


Discuss the validity of the following argument: 

All graduates are educated. 

Ram is a graduate. 

Therefore. Ram is educated. 

Solution 

Let G(x) denote A is a graduate’. 

Let E(x ) denote x is educated’. 

Let R denote 'Ram’. 

So the premises are (i) Vx (G(x) E{x)) and (ii) G(R). The conclusion is E(R). 

Vx (G(x) ==> E(x)) Premise (i) 

G(R) => E(R) Universal instantiation 

G(R) Premise (ii) 

E(R) Modus ponens RI 4 

Thus the conclusion is valid. 
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EXAMPLE 1.23 


Discuss the validity of the following argument: 

All graduates can read and write. 

Ram can read and write. 

Therefore, Ram is a graduate. 

Solution 

Let G(x) denote A is a graduate’. 

Let L( x) denote x can read and write’. 

Let R denote ‘Ram 5 . 

The premises are: Vx (G(x) => L(x)) and L(R). 

The conclusion is G(R). 

(( G(R ) => L(R)) a L(R)) => G(R ) is not a tautology. 

So we cannot derive G(R). For example, a school boy can read and write 
and he is not a graduate. 


EXAMPLE 1.24 


Discuss the validity of the following argument: 

All educated persons are well behaved. 
Ram is educated. 

No well-behaved person is quarrelsome. 
Therefore. Ram is not quarrelsome. 


Solution 


Let the universe of discourse be the set of all educated persons. 
Let P(x) denote *x is well-behaved’. 

Let y denote ‘Ram’. 

Let Q(x) denote \v is quarrelsome’. ^ 

So the premises are: 


(i) VxP(x). 

(ii) y is a particular element of the universe of discourse, 
(hi) Vx (P(x) => CM)- 


To obtain the conclusion, we have the following arguments: 


1. 

2 . 

3. 

4. 

5. 

6 . 


V.Y P(x) 

P(y) 

Vx (P(x) 
P(v) => ■ 

P(y) 

- Q(y) 


GM) 


Premise (i) 

Universal instantiation RI i3 
Premise (iii) 

Universal instantiation /?/ 13 
Line 2 

Modus ponens RI A 


—i Q(y) means that ; Ram is not quarrelsome'. Thus the argument is valid. 
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1.6 SUPPLEMENTARY EXAMPLES 


EXAMPLE 1.25 


Write the following sentences in symbolic form: 

(a) This book is interesting but the exercises are difficult. 

(b) This book is interesting but the subject is difficult. 

(c) This book is not interesting, the exercises are difficult but the subject 
is not difficult. 

(d) If this book is interesting and the exercises are not difficult then the 
subject is not difficult. 

(e) This book is interesting means that the subject is not difficult, and 
conversely. 

(f) The subject is not difficult but this book is interesting and the 
exercises are difficult. 

(g) The subject is not difficult but the exercises are difficult. 

(h) Either the book is interesting or the subject is difficult. 

Solution 

Let P denote ‘This book is interesting*. 

Let Q denote The exercises are difficult 5 . 

Let R denote The subject is difficult’. 

Then: 

(a) P a Q 

(b) P a R 

(c) -1 P A Q A -1 R 

(d) (P a — i Q) => —i R 

(e) P <^> -iR 

(f) (- R) a (P a Q) 

(g) —iR a Q 

(h) nP V R 


EXAMPLE 1.26 


Construct the truth table for a = (—i P —i Q) <=> Q R 

Solution 

The truth table is constructed as shown in Table 1.16. 
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TABLE 1.16 Truth Table of Example 1.26 

P 

Q 

R 

Q <=> R 

nP 

—i Q 

—, P ~s Q 

a 

T 

T ■ 

T 

T 

F 

F 

T 

T 

T 

T 

F 

F 

F 

F 

T 

F 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

T 

F 

F 

F 

F 

T 

F 

F 

T 

F 

F 

T 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

T 

T 



Prove that: a = ((P => (Q v R )) a (-, 0) => (P => i?) is a tautology. 

Solution 

Let & = OP =* (Q v /?)) a (-, 0 

The truth table is constructed as shown in Table 1.17. From the truth 
table, we conclude that a is a tautology. 


TABLE 1.17 Truth Table of Example 1.27 


P 

Q 

R 

-n Q 

Q v R 

P => (Q v R) 

v 

P => R 

a 

T 

T 

T 

F 

T 

T 

F 

T 

T 

T 

T 

F 

F 

T 

T 

F 

F 

T 

T 

F 

T 

T 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

F 

F 

T 

F 

T 

T 

F 

T 

T 

F 

T 

T 

F 

T 

F 

F 

T 

T 

F 

T 

T 

F 

F 

T 

T 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

T 

T 

T 


EXAMPLE 1.28 


State the converse, opposite and contrapositive to the following statements: 

(a) If a triangle is isoceles, then two of its sides are equal. 

(b) If there is no unemployment in India, then the Indians won’t go to 
the USA for employment. 

Solution 

If p => Q is a statement, then its converse, opposite and contrapositive 
statements are, Q P, —\Q and -i Q => —i P % respectively. 

(a) Converse—If two of the sides of a triangle are equal, then the triangle 
is isoceles. 
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Opposite—If the triangle is not isoceles, then two of its sides are not 
equal. 

Contrapositive—If two of the sides of a triangle are not equal, then 
the triangle is not isoceles. 

(b) Converse—If the Indians won’t go to the USA for employment, then 
there is no unemployment in India. 

Opposite—If there is unemployment in India, then the Indians will go 
to the USA for employment. 

(c) Contrapositive—If the Indians go to the USA for employment, then 
there is unemployment in India. 


EXAMPLE 1.29 


Show that: 

(-1 P a (-i Q a R)) v ( Q a R) v (P a R) R 

Solution 


(—i P a (— i Q a R)) v (Q A R) V (P A R) 

<=> ((-i P a -i Q) a R) v (Q a R) v (P a R) by using the associative law 

<=> (-1 (P v Q) a R) v (Q a R) v (P a R) by using the DeMorgan’s law' 

<=> (—t (P v Q) a R) v (Q v P) a R) by using the distributive law 

(-i (P v Q) v (P v Q) a R by using the commutative 

and distributive laws 

<=> T a i? by using / g 

<=> R by using / 9 



Using identities, prove that: 

Q v {P a —i Q) v (—11 P a — 1 0 is a tautology 


Solution 

Q v (P a 0 v (-, P a 0 
<=> ((C v P) A (Q v —, 2)) V ( I P A 
<=> ((0 v P) A T) V (-, P A -i 0) 
o (Q V />) V (P v 2) 

<=> CP v Q) v —i ( P v Q) 

T 

Hence the given formula is a tautology. 


0 by using the distributive law 
by using / 8 

by using the DeMorgan’s law' 
and L) 

by using the commutative 
law 

by using / 8 
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EXAMPLE 1.31 


Test the validity of the following argument: 

If I get the notes and study well, then I will get first class. 
I didn't get first class. 

So either I didn’t get the notes or I didn’t study well. 

Solution 

Let P denote 'I get the notes'. 

Let Q denote T study well’. 

Let R denote T will get first class.’ 

Let S denote T didn’t get first class.’ 

The given premises are: 

(i) P a Q => R 

(ii) R 


The conclusion is — i P v — i Q. 


1. P a Q => R 

Premise (i) 

2 . -iR 

Premise (ii) 

3. — i (P a Q) 

Lines 1, 2 and modus tollens. 

4. -iP v 

DeMorgan's law 

Thus the argument is valid. 



EXAMPLE 1.32 


Explain (a) the conditional proof rule and (b) the indirect proof. 

Solution 

(a) If we want to prove A B, then we take A as a premise and construct 
a proof of B. This is called the conditional proof rule. It is denoted 
by CP. 

(b) To prove a formula a, we construct a proof of —i a => F. In 
particular, to prove A => B. we construct a proof of A a —i B => F. 



Test the validity of the following argument: 

Babies are illogical. 

Nobody is despised wdio can manage a crocodile. 
Illogical persons are despised. 

Therefore babies cannot manage crocodiles. 
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Solution 

Let 8(a) denote ‘x is a baby’. 

Let 7(a) denote 'a is illogical'. 

Let D(x) denote ‘x is despised’. 

Let C(x) denote ‘x can manage crocodiles’. 

Then the premises are: 


(i) Va (8(a) => 7(a)) 

(ii) Va (C(x) =* D(x)) 

(iii) Va (7(a) => D( a)) 

The conclusion is Va (8(a) => 

1. Va (8(a) => 7(a)) 

2. Va (C(a) => 77(a)) 

3. Va (7(a) => D(a)) 

4. 8(a) => 7(a) 

5. C(a) => D{x) 

6. 7(a) => D(x) 

7. 8(a) 

8. 7(a) 

9. D(a) 

10. —i C(a) 

11. 8(a) => C( a) 

12. Va (8(a) => C( a)) 
Hence the conclusion is valid. 


EXAMPLE 1.34 


Give an indirect proof of 


-H C(x)). 

Premise (i) 

Premise (ii) 

Premise (iii) 

1, Universal instantiation 

2, Universal instantiation 

3, Universal instantiation 
Premise of conclusion 

4.7 Modus pollens 

6.8 Modus pollens 

5.9 Modus tollens 

7.10 Conditional proof 

11, Universal generalization, 


P ^ Q. P v S) ^ S 


Solution 

We have to prove S. So we include (iv) —i S as a premise. 


1. P V S 

Premise (iii) 

2 . -tS 

Premise (iv) 

3. P 

1,2, Disjunctive syllogism 

4. P => Q 

Premise (ii) 

5. Q 

3,4, Modus ponens 

6. —i C 

Premise (i) 

7. Q a —i Q 

5,6, Conjuction 

8 . F 

4 

We get a contradiction. Hence (-i Q* P => Q* P v 
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EXAMPLE 1.35 


Test the validity of the following argument: 

All integers are irrational numbers. 

Some integers are powers of 2. 

Therefore, some irrational number is a power of 2. 

Solution 

Let Z(x) denote is an integer'. 

Let I(x) denote A is an irrational number’. 

Let P(x) denote ‘x is a power of 2\ 

The premises are: 

(i) V* (Z(x) => /(*)) 

(ii) 3x (Z(x) a P(x)) 

The conclusion is 3x (I(x) a P(x)). 


1. 3a (Z(.X) A P(x)) 

Premise (ii) 

2. Z(fc) A P(Z?) 

1, Existential instantiation 

3. Zib) 

2, Simplification 

4. P(Z>) 

2, Simplification 

5. V* (Z(x) => /(.v)) 

Premise (i) 

6. Z(2>) => /(Z>) 

5. Universal instantiation 

7. /(fc) 

3,6, Modus ponens 

8. /(&) a P(Z>) 

7,4 Conjunction 

9. 3a (/(a) a P(a)) 

8, Existential instantiation. 


Hence the argument is valid. 

SELF-TEST 

Choose the correct answer to Questions 1-5: 

1. The following sentence is not a proposition. 

(a) George Bush is the President of India. 

(b) y/Zi is a real number. 

(c) Mathematics is a difficult subject. 

(d) I wish you all the best. 

2. The following is a well-formed formula. 

(a) (P a Q) => (P v Q) 

(b) (P a Q) => (P v O) a R) 

(c) (P a (Q a R)) => (P a Q )) 

(d) i (Q a i (P v i Q) 


http://engineeringbooks.net 



32 B Theory of Computer Science 


~ P a Q . u , 

3 . -~ is called: 

P 

(a) Addition 

(b) Conjunction 

(c) Simplification 

(d) Modus tollens 

4. Modus ponens is 

(a) —i Q 

P^Q 
-i P 

(b) -n P 
PvQ 

• -Q 

(c) P 

P Q 

M2 

(d) none of the above 

5. nP a a i? is a minterm of: 

(a) P v Q 

(b) —i P a -i Q a R 

(c) P A Q A R A S 

(d) P A R 

6 . Find the truth value of P => Q if the truth values of P and Q are F and 
T respectively. 

7. For what truth values of P. 2 and P, the truth value of (P =$> Q) => R 
is F? 

(P, 2> P have the truth values F, F, F or F, F, F) 

8. If P. P have the truth values F. F, F, respectively, find the truth 
value of (P => 2) v (P => P)* 

9. State universal generalization. 

10. State existential instantiation. 

EXERCISES 

1.1 Which of the following sentences are propositions? 

(a) A triangle has three sides. 

(b) 11111 is a prime number. 

(c) Every dog is an animal. 
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(d) Ram ran home. 

(e) An even number is a prime number. 

(f) 10 is a root of the equation jc - 1002x + 10000 = 0 

(g) Go home and take rest. 

1.2 Express the following sentence in symbolic form: For any two numbers 
a and b , only one of the following holds: ct < b, ci = h, and a > b. 

1.3 The truth table of a connective called Exclusive OR (denoted by v) is 
shown in Table 1.18. 


TABLE 1.18 Truth Table for Exclusive OR 


p 

Q 

P v Q 

T 

T 

F 

T 

c 

T 

F 

T 

T 

F 

F 

F 


Give an example of a sentence in English (i) in which Exclusive OR 
is used, (ii) in which OR is used. Show that v is associative, 
commutative and distributive over a. 

1.4 Find tw ? o connectives, using which any other connective can be 
described. 

1.5 The connective NAND denoted by T (also called the Sheffer stroke) is 
defined as follows: P T Q - -n (P a Q). Show that every connective 
can be expressed in terms of NAND. 

1.6 The connective NOR denoted by -l (also called the Peirce arrow) is 
defined as follows: P si Q = —i (P v Q). Show 7 that every connective 
can be expressed in terms of NOR. 

1.7 Construct the truth table for the following: 

(a) (P (OP v R) => (R v Q)) 

(b) (P v (Q => R)) (CP v —i R) => Q) 

1.8 Prove the following equivalences: 

ta) (—i P (—i P {—i P a Q))) — P v Q 

(b) P = (P v 0 a (P v —\ Q) 

(C) “i (P <=> Q) = (P a —i 0 v (-i P a Q) 

1.9 Prove the logical identities given in Table 1.11 using truth tables. 

1.10 Show that P => (Q (R (-i P (—i Q —, R))) is a tautology. 

1.11 Is (P => —i P) => P (i) a tautology, (ii) a contradiction, (iii) neither 
a tautology nor a contradiction? 
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1.12 Is the implication (P a (P => —i g)) v(g=>-ig)=>-i2 a tautology? 

1.13 Obtain the principal disjunctive normal form of the following: 

(a) p => (P => g a (-, (-, g v P))) 

(b) (g a R a 5) v (R a 5). 

1.14 Simplify the formula whose principal disjunctive normal form is 
110 v 100 v 010 v 000. 

1.15 Test the validity of the following arguments: 

(a) P => g 

R => -ig 

P => -i R 

(b) R => G 
p => <2 
-,/?=> S 

P=> s 

(c) P 

o 

-iQ =$ R 
g => -iP 
P 

(d) P => G A P 
g v s => r 

Sv P 
T 

1.16 Test the validity of the following argument: 

If Ram is clever then Prem is well-behaved. 

If Joe is good then Sam is bad and Prem is not well-behaved. 

If Lai is educated then Joe is good or Ram is clever. 

Hence if Lai is educated and Prem is not well-behaved then Sam is bad. 

1.17 A company called for applications from candidates, and stipulated the 
following conditions: 

(a) The applicant should be a graduate. 

(b) If he knows Java he should know C++. 

(c) If he knows Visual Basic he should know Java. 

(d) The applicant should know Visual Basic. 

Can you simplify the above conditions? 

1.18 For what universe of discourse the proposition V.v(x > 5) is true? 
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1.19 By constructing a suitable universe of discourse, show that 

3x (PC*) => Q(x)) <=> (3x P(x) => 3x Q(x)) 
is not valid. 

1.20 Show that the following argument is valid: 

All men are mortal. 

Socrates is a man. 

So Socrates is mortal. 

1.21 Is the following sentence true? If philosophers are not money-minded 
and some money-minded persons are not clever, then there are some 
persons who are neither philosphers nor clever. 

1.22 Test the validity of the following argument: 

No person except the uneducated are proud of their wealth. 

Some persons who are proud of their wealth do not help others. 
Therefore, some uneducated persons cannot help others. 
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Mathematical 

Preliminaries 


In this chapter we introduce the concepts of set theory and graph theory. Also, 
we define strings and discuss the properties of strings and operations on strings. 
In the final section we deal with the principle of induction, which will be used 
for proving many theorems throughout the book. 


2.1 SETS, RELATIONS AND FUNCTIONS 

2.1.1 Sets and Subsets 

A set is a well-defined collection of objects, for example, the set of all students 
in a college. Similarly, the collection of all books in a college library is also a 
set. The individual objects are called members or elements of the set. 

We use the capital letters A, R, C. . .. for denoting sets. The small letters 
a, b, c, . . . are used to denote the elements of any set. When a is an element 
of the set A, we write a e A. When a is not an element of A, we write a £ A. 

Various Ways of Describing a Set 

(i) By listing its elements . We write all the elements of the set (without 
repetition) and enclose them within braces. We can write the elements 
in any order. For example, the set of all positive integers divisible by 
15 and less than 100 can be written as {15. 30. 45, 60, 75, 90}. 

(ii) By describing the properties of the elements of the set. For example, the 
set {15, 30. 45. 60. 75, 90} can be described as: {n [ n is a positive 
integer divisible by 15 and less than 100}. (The description of the 
property is called predicate . In this case the set is said to be implicitly 
specified.) 


36 
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(iii) By recursion. We define the elements of the set by a computational 
rule for calculating the elements. For example, the set of all natural 
numbers leaving a remainder 1 when divided by 3 can be described as 

{a n | a 0 = 1, a n+x = a„ + 3} 

When the computational rule is clear from the context, we simply specify 
the set by some initial elements. The previous set can be written as {1, 4, 7, 
10, . . .}. The four elements given suggest that the computational rule is: 
3. 

Subsets and Operations on Sets 

A set A is said to be a subset of B (written as A c B) if every element of 
A is also an element of B. 

Two sets A and B are equal (we write A-B) if their members are the same. 
In practice, to prove that A = B, we prove AcB and B c A. 

A set with no element is called an empty set, also called a null set or a void 
set. and is denoted by 0 . 

We define some operations on sets. 

A u B = {x | x e A or x e B }, called the union of A and B. 

A n B = {x | x e A and x e B}, called the intersection of A and B. 

A - B = {x | x e A and x £ B }. called the complement of B in A. 

A c denotes U - A, where U is the universal set, the set of all elements 

under consideration. 

The set of all subsets of a set A is called the power set of A. It is denoted 
by 2 a . 

Let A and B be two sets. Then A x B is defined as {( a , b)\a e A and 
b e B). ((a, b) is called an ordered pair and is different from (b, a).) 

Definition 2.1 Let 5 be a set. A collection (A h A 2 ,.. A n ) of subsets of S is 

n 

called a partition if A, n A; = 0 (i ^ j) and S = U A, (i.e. A\ u A 2 U ... 

i =1 

u A„). 

For example, if S = {1, 2. 3, . .10}, then {{1. 3, 5, 7. 9}, {2, 4, 6, 
8. 10}} is a partition of 5. 

2.1.2 Sets with One Binary Operation 

A binary operation * on a set S is a rule which assigns, to every ordered pair 
(a, b) of elements from 5. a unique element denoted by a * b. 

Addition, for example, is a binary 7 operation on the set Z of all integers. 
(throughout this book. Z denotes the set of all integers.) 

Union is a binary operation on 2 A , where A is any nonempty set. We give 
below five postulates on binary operations. 

Postulate 1: Closure. If a and b are in S, then a * b is in S. 
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Postulate 2: Associativity. If a , b, c are in S , then (a * ft) * c = a * (fc * c). 
Postulate 3: Identity- element There exists a unique element (called the 
identity element) e in S such that for any element x in 5, 

x*e = e*x = x. 

Postulate 4: Inverse. For every element x in S there exists a unique element x 
in S such that x * x' = x' * x = e. The element x' is called the 
inverse of „r w.r.t. *. 

Postulate 5: Commutativity. If a, b e S , then a * b = b * a. 

It may be noted that a binary operation may satisfy none of the above five 
postulates. For example, let S = {1, 2, 3, 4, ...}, and let the binary operation 
be subtraction (i.e. a * b = a - b). The closure postulate is not satisfied since 
2 - 3 = -1 £ S. Also, (2 - 3) - 4 ^ 2 - (3 - 4), and so associativity is not 
satisfied. As we cannot find a positive integer such that x~e-e-x-x, the 
postulates 3 and 4 are not satisfied. Obviously, a - b * b - a. Therefore, 
commutativity is not satisfied. 

Our interest lies in sets with a binary operation satisfying the postulates. 

Definitions (i) A set S with a binary operation * is called a semigroup if the 
postulates 1 and 2 are satisfied. 

(ii) A set S with a binary 7 operation * is called a monoid if the postulates 
1-3 are satisfied. 

(iii) A set S with * is called a group if the postulates 1-4 are satisfied. 

(iv) A semigroup (monoid or group) is called a commutative or an abelian 
semigroup (monoid or group) if the postulate 5 is satisfied. 

Figure 2.1 gives the relationship between semigroups, monoids, groups, 
etc. where the numbers refer to the postulate number. 



Fig. 2.1 Sets with one binary operation. 


We interpret Fig. 2.1 as follows: A monoid satisfying postulate 4 is a group. 
A group satisfying postulate 5 is an abelian group, etc. 
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We give below a few examples of sets with one binary operation: 

(i) Z with addition is an abelian group. 

(ii) Z with multiplication is an abelian monoid. (It is not a group since it 
does not satisfy the postulate 4.) 

(iii) {1, 2, 3. . . .} with addition is a commutative semigroup but not a 
monoid. (The identity element can be only 0, but 0 is not in the set.) 

(iv) The power set 2 A of A(A * 0) with union is a commutative monoid. 
(The identity element is 0.) 

(v) The set of all 2 x 2 matrices under multiplication is a monoid but not 
an abelian monoid. 

2 , 1.3 Sets with Two Binary Operations 

Sometimes we come across sets with two binary operations defined on them 
(for example, in the case of numbers we have addition and multiplication). Let 
5 be a set with two binary operations * and o. We give below 11 postulates 
in the following way: 

(i) Postulates 1-5 refer to * postulates. 

(ii) Postulates 6. 7. 8. 10 are simply the postulates L 2. 3, 5 for the binary 
operation o. 

(iii) Postulate 9: If S under * satisfies the postulates 1-5 then for every x 
in S . with .v ^ e, there exists a unique element x' in S such that x o x - 
x o x = e\ where e is the identity element corresponding to o. 

(iv) Postulate 11: Distributivity . For a . b, c. in S 

a. o (b * c) = (a o b) * ( a o c) 

A set with one or more binary operations is called an algebraic system. 
For example, groups, monoids, semigroups are algebraic systems with one 
binary operation. 

We now 7 define some algebraic systems with two binary operations. 

Definitions (i) A set with tw 7 o binary operations * and o is called a ring if 
(a) it is an abelian group w.r.t. *. and (b) o satisfies the closure, associativity 
and distributivity postulates (i.e. postulates 6, 7 and 11). 

(ii) A ring is called a commutative ring if the commutativity postulate is 
satisfied for o. 

(iii) A commutative ring with unity is a commutative ring that satisfies the 
identity postulate (i.e. postulate 8) for o. 

(iv) A field is a set with two binary operations * and o if it satisfies the 
postulates 1-11. 

We now 7 give below 7 a few examples of sets with two binary operations: 

(i) Z with addition and multiplication (in place of * and o) is a 
commutative ring w 7 ith identity. (The identity element w'.r.t. addition is 
0, and the identity element w.r.t. multiplication is 1.) 
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(ii) The set of all rational numbers (i.e. fractions which are of the form 
alb . where a is any integer and b is an integer different from zero) 
is a field. (The identity element w.r.t. multiplication is 1. The inverse 
of alb, alb * 0 is b/a.) 

(iii) The set of all 2 x 2 matrices with matrix addition and matrix 
multiplication is a ring with identity, but not a field. 

(iv) The power set 2 4 (A & 0) is also a set with two binary operations 
u and n. The postulates satisfied by u and n are 1, 2, 3, 5, 6, 7, 8, 
10 and 11. Tire power set 2- 4 is not a group or a ring or a field. But 
it is an abelian monoid w.r.t. both the operations u and n. 

Figure 2.2 illustrates the relation between the various algebraic systems we 
have introduced. The interpretation is as given in Fig. 2.1. The numbers refer 
to postulates. For example, an abelian group satisfying the postulates 6, 7 and 
11 is a ring. 



Fig. 2.2 Sets with two binary operations. 


2.1.4 Relations 

The concept of a relation is a basic concept in computer science as well as in 
real life. This concept arises when w^e consider a pair of objects and compare 
one with the other. For example, ‘being the father of’ gives a relation between 
two persons. We can express the relation by ordered pairs (for instance, 4 a is 
the father of V can be represented by the ordered pair {a, b)). 

While executing a program, comparisons are made, and based on the result, 
different tasks are performed. Thus in computer science the concept of relation 
arises just as in the case of data structures. 

Definition 2.2 A relation R in a set S is a collection of ordered pairs of 
elements in S (i.e. a subset of S x 5). When ( x , y ) is in R , we write xRy . When 
( x , v) is not in R . we write xR ' y . ■ 
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SiSESSS! 

A relation R in Z can be defined by xRy if x > y. 

Properties of Relations 

(i) A relation R in S is reflexive if xRx for even 7 x in 5. 

(ii) A relation R in 5 is symmetric if for x, y in S, yRx whenever xRy. 

(iii) A relation R in S is transitive if for v, y and - in 5. xRz whenever xRy 
and yRz. 

We note that the relation given in Example 2.1 is neither reflexive nor 
symmetric, but transitive. 


EXAMPLE 2.2 


A relation R in {1, 2, 3. 4. 5. 6} is given by 

{(1. 2). (2. 3). (3. 4), (4. 4). (4, 5)} 

This relation is not reflexive as li?'l. It is not symmetric as 2R3 but 3R'2. It 
is also not transitive as 1R2 and 2R3 but li?'3. 


EXAMPLE 2.3 


Let us define a relation /? in {1. 2. .... 10} by aRb if a divides b. R is 
reflexive and transitive but not symmetric (3R6 but 6i?'3). 


EXAMPLE 2.4 


If z, j, n are integers we say that / is congruent to j modulo n (written as 
i = j modulo n or i =j mod n) if i -j is divisible by zz. The ‘congruence modulo 
n is a relation which is reflexive and symmetric (if i - j is divisible by n, so 
is / - /)• If i = j mod n and j = k mod n , then we have i - j = an for some a 
and j - k - bn for some b. So. 

= k = an + bn 

which means that i = k mod n . Thus this relation is also transitive. 

Definition 2.3 A relation R in a set S is called an equivalence relation if it is 
reflexive, symmetric and transitive. 

Example 2.5 gives an equivalence relation in Z. 
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EXAMPLE 2.5 


We can define an equivalence relation R on any set 5 by defining aRb if 
a ~ b. (Obviously, a - a for every a. So, R is reflexive. If a - b then b = a. 
So i? is symmetric. Also, if a - b and b = c, then a = c. So i? is transitive.) 


EXAMPLE 2.6 


Define a relation i? on the set of all persons in New Delhi by aRb if the persons 
a and b have the same date of birth. Then R is an equivalence relation. 

Let us study this example more carefully. Corresponding to any day of the 
year (say, 4th February), we can associate the set of all persons bom on that 
day. In this way the set of all persons in New Delhi can be partitioned into 366 
subsets. In each of the 366 subsets, any two elements are related. This leads to 
one more property of equivalence relations. 

Definition 2.4 Let R be an equivalence relation on a set S. Let a e 5. Then 
C a is defined as 

{b e S | aRb} 

The C a is called an equivalence class containing a. In general, the^Q/s are 
called equivalence classes. 


EXAMPLE 2.7 


For the congruence modulo 3 relation on {1, 2, .... 7}, 

C 2 = {2, 5}, Ci = {L 4, 7}, C 3 = {3, 6} 

For the equivalence relation ‘having the same birth day’ (discussed in Example 
2.6), the set of persons bom on 4th February is an equivalence class, and the 
number of equivalence classes is 366. Also, we may note that the union of all 

the 366 equivalence classes is the set of all persons in Delhi. This is true for 

any equivalence relation because of the following theorem. 

Theorem 2.1 Any equivalence relation R on a set S partitions S into disjoint 
equivalence classes. 

Proof Let (J C a denote the union of distinct equivalence classes. We have to 
prove that: aeS 

(i) S= U C a , 

ae5 

(ii) C a n C b = 0 if C a and C h are different, i.e. C a * C b . 

Let s e S. Then 5 e C s (since sRs , R being reflexive). But C s c U C a . 

aeS 

So S c U C a . By definition of C a9 C a c S for every a in S. So U C fl c S. 

ae5 aeS 

Thus we have proved (i). 

Before proving (ii), we may-note the following: 


Q - Cb 


if aRb 


( 2 . 1 ) 
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As aRb, we have bRa because R is symmetric. Let d e C a . By definition of 
C a , we have aRd. As bRa and aRd. by transitivity of R, we get bRd. This 
means d <= C b . Thus we have proved C a c Q. In a similar way we can show 
that C b c Q. Therefore. (2.1) is proved. 

Now we prove (ii) by the method of contradiction (refer to Section 2.5). 
We want to prove that C a n C b = 0 if C a ^ C /r Suppose Q n Q# 0. Then 
there exists some element d in S such that d e C a and d e Q,. As d e C fl , 
we have aRd . Similarly, we have fcftr/. By symmetry of ft, dRb. As aftd 
and dRb, by transitivity of ft, we have aRb. Now we can use (2.1) to 
conclude that C a = C h . But this is a contradiction (as Q ^ Q). Therefore, 
C a r\ C b = 0. Thus (ii) is proved. I 

If we apply Theorem 2.1 to the equivalence relation congruence modulo 
3 on {1. 2. 3, 4, 5, 6. 7}, we get 

c x = C 4 = C 7 = {1. 4, 7} 

C : = C 5 = {2, 5} 

C 3 = Q = {3. 6} 

and therefore, 

{1. 2, . . ., 7} = Cj u C 2 u C 3 

EXERCISE Let S denote the set of all students in a particular college. 
Define aRb if a and b study in the same class. What are the equivalence 
classes? In what way does R partition 5? 

2.1.5 Closure of Relations 

A given relation R may not be reflexive or transitive. By adding more ordered 
pairs to R we can make it reflexive or transitive. For example, consider a 
relation R - {(L 2), (2, 3), (1, 1). (2, 2)} in {1, 2, 3}. R is not reflexive as 3iC3. 
But by adding (3, 3) to R, we get a reflexive relation. Also, R is not transitive 
as 1R2 and 2R3 but IR'3. By adding the pair (1, 3), we get a relation 
T = {(1. 2), (2, 3), (I. 1), (2, 2), (1. 3)} which is transitive. There are many 
transitive relations T containing R. But the smallest among them is interesting. 

Definition 2.5 Let R be a relation in a set S . Then the transitive closure of R 
(denoted by R *) is the smallest transitive relation containing R. 

Note: We can define reflexive closure and symmetric closure in a similar way. 

Definition 2.6 Let R be a relation in S. Then the reflexive-transitive 
closure of R (denoted by R*) is the smallest reflexive and transitive relation 
containing R. 

For constructing R + and ft* w ? e define the composite of two relations. 
Let R { and R 2 be the two relations in S. Then, 

(i) ^oi?] = {(n. c) € 5 x S | aR x b and bR 2 c for some b e 5} 

(ii) ftf = fti o ftj 

(iii) ft" = ft" -1 o R x for all n > 2 
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Note: For getting the elements of R { o R 2 , we combine {a, b) in R x and 
( b, c) in R 2 to get (a, c) in R { o R 2 . 

Theorem 2.2 Let S be a finite set and R be a relation in S. Then the 
transitive closure R + of R exists and R + - R u R 2 u R 3 . . . . 


EXAMPLE 2.8 


Let R = {(1. 2). (2. 3), (2, 4)} be a relation in {1, 2, 3, 4}. Find R + . 

Solution 

R = {(1, 2), (2, 3), (2, 4)} 

R 2 = {(1, 2), (2, 3), (2, 4)} o {(1, 2), (2, 3), (2, 4)} 

= {CL 3), (1, 4)} 

(We combine (a, b) and ( b , c) in R to get ( a , c) in R 2 .) 

R 2 = R 2 o R = {(1, 3). (1, 4)} o {(1, 2), (2, 3), (2, 4)} = 0 

(Here no pair (a, b) in Rr can be combined with any pair in R2 )—_. 

R 4 = R 5 = • • • = 0 

R + = r u R 2 = {(1, 2), (2, 3), (2. 4), (1, 3), (1, 4)} 


EXAMPLE 2.9 


Let R = {(a. b), ( b, c), (c, a)}. Find R + . 

Solution 

R= {(a. b), ( b, c), ( c, a)} 

R o R = {(a. b), (b, c), (c, a)} o {{a, b), ( b, c), (c, a)} 

= {(a. c), ( b, a), ( c , b)} 

(This is obtained by combining the pairs: (a, b) and (b, c), (b, c) and (c, a), 

and (c, a) and (a, b).) 

= R 2 o R = {{a, c ), (b, a), (c, b)\ o {(a, b), ( b, c), ( c, a)} 

= {(a, a), (b, b), ( c , c)} 

R 4 = R' o R = {{a, a), (b, b), (c, c)} o {{a, b), (b, c), ( c, a)} 

= {(a, b), (b, c), (c, a)} = R 
So, 

R 5 = R 4 o R = R o R = R 2 , R 6 = R 5 o R = R 2 o R = R 3 
R 7 =R 6 oR = R 2 oR = R 4 = R 
Then any R n is one of R, R 2 or R 3 . Hence, 

R + = R u R 2 u R 3 

= {{a, b), (b, c), (c, a), .(a, c), (b, a), (c y b ), (a, a), (b } b), (c, c)} 
Note: R* — R + u {(a, a) J a € 5}. 
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EXAMPLE 2.10 


If R = {(a, b), (b, c), (c , ci)} is a relation in {a, b , c}, find R*. 

Solution 

From Example 2.9, 

R* = R + u {(<a , a), (fe, £>), (c, c)} 

= {(a, &), (b, c), (c, a), (a, c), (£>, a), (c, fe), (a, a), (fc, fe), (c, c)} 


EXAMPLE 2.11 


What is the symmetric closure of relation R in a set 5? 

Solution 

Symmetric closure of R = R u {(£>, a) | 


2.1.6 Functions 

The concept of a function arises when we want to associate a unique value (or 
result) with a given argument (or input). 

Definition 2.7 A function or map / from a set A to a set Y is a rule which 
associates to every' element x in A a unique element in Y y which is denoted by 
f(x). The element/(.v) is called the image of x under/. The function is denoted 
by f: X Y. 

Functions can be defined either (i) by giving the images of all elements 
of X, or (ii) by a computational rule which computes fix) once x is given. 

EXAMPLES (a)/: (1. 2. 3. 4} -> {a, b y c} can be defined by/(I) = a y 
/(2) = c, f( 3) = a y fi 4) = b. 

(b) f'.R-^R can be defined by fix) = xr + 2x + 1 for every a: in R. 
(R denotes the set of all real numbers.) 

Definition 2.8 / : X —> Y is said to be one-to-one (or injective) if different 
elements in X have different images, i.e. f(x { ) ^ f(x 2 ) when x\ & x 2 . 

Note: To prove that / is one-to-one, we prove the following: Assume 
fixi) = f(x 2 ) and show that jq = x 2 - 

Definition 2.9 /: X —¥ Y is onto (suqective) if every element y in Y is the 
image of some element „v in X. 

Definition 2.10 / : X —> Y is said to be a one-to-one correspondence or 
bijection if/is both one-to-one and onto. 
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EXAMPLE 2.12 


/ : Z —> Z given by f(n) - 2n is one-to-one but not onto. 

Solution 

Suppose/(^i) = /(wi). Then 2n r = 2n 2 . So n\ - ju- Hence/is one-to-one. It 
is not onto since no odd integer can be the image of any element in Z (as any 
image is even). 

The following theorem distinguishes a finite set from an infinite set. 

Theorem 2.3 Let 5 be a finite set. Then /: S -> S is one-to-one iff it is onto. 

Note: The above result is not true for infinite sets as Example 2.12 gives a 
one-to-one function / : Z -» Z which is not onto. 


EXAMPLE 2.13 


Show that / : R R - {1} given by f(x) = (x + 1 )i(x - 1) is onto. 

Solution 

Let y e R. Suppose y = fix) = (x + l)/(x - 1). Then y(x - 1) = x + 1, i.e. 
yx - x = 1 + y. So. x = (1 + y)/(y - 1). As (1 + y)/(y - 1) e R for all y * 1. 
y is the image of (1 + y)Ky - 1) in R - {1}. Thus,/is onto. 

The Pigeonhole Principle + 

Suppose a postman distributes 51 letters in 50 mailboxes (pigeonholes). Then 
it is evident that some mailbox will contain at least two letters. This is 
enunciated as a mathematical principle called the pigeonhole principle. 

If n objects are distributed over m places and n > m, then some place 
receives at least two objects. 


EXAMPLE 2.14 


If we select 11 natural numbers between 1 to 380, show that there exist at least 
two among these 11 numbers whose difference is at most 38. 

Solution 

Arrange the numbers 1, 2, 3, . . 380 in 10 boxes, the first box containing 

1, 2. 3, . . .. 38. the second containing 39, 40, . . 76, etc. There are 11 

numbers to be selected. Take these numbers from the boxes. By the pigeonhole 
principle, at least one box will contain two of these eleven numbers. These two 
numbers differ by 38 or less. 


~ The pigeonhole principle is also called the Dirichlet drawer principle, named 
after the French mathematician G. Lejeune Dirichlet (1805-1859). 
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2.2 GRAPHS AND TREES 

The theory of graphs is widely applied in many areas of computer science— 
formal languages, compiler writing, artificial intelligence (AT), to mention only 
a few. Also, the problems in computer science can be phrased as problems in 
graphs. Our interest lies mainly in trees (special types of graphs) and their 
properties. 

2.2.1 Graphs 

Definition 2.11 A graph (or undirected graph) consists of (i) a nonempty set 
V 7 Culled the set of vertices, (ii) a set E called the set of edges, and (iii) a map 
O which assigns to ever} 7 edge a unique unordered pair of vertices. 

Representation of a Graph 

Usually a graph, namely the undirected graph, is represented by a diagram 
where the vertices are represented by points or small circles, and the edges by 
arcs joining the vertices of the associated pair (given by the map <£>). 

Figure 2.3. for example, gives an undirected graph. Thus, the unordered 
pair {vj, v 2 } is associated with the edge e { : the pair (v 2 , v 2 ) is associated with 
e 6 . (e 6 is a self-loop. In general, an edge is called a self-loop if the vertices in 
its associated pair coincide.) 


e 6 


e 5 


Fig. 2.3 An undirected graph. 

Definition 2.12 A directed graph (or digraph) consists of (i) a nonempty set 
V called the set of vertices, (ii) a set E called the set of edges, and (iii) a map 
O which assigns to every edge a unique ordered pair of vertices. 

Representation of a Digraph 

The representation is as in the case of undirected graphs except that the edges 
i,:c represented by directed arcs. 

Figure 2.4. for example, gives a directed graph. The ordered pairs (v 2t v 3 ), 
(v 3 , v_A (vi, v 3 ) are associated with the edges e% ee 2 , respectively. 
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Definitions (i) If (v h v ; ) is associated with an edge e, then v; and v ; * are called 
the end vertices of e; v z is called a predecessor of V; which is a successor of v r 

In Fig. 2.3, vs and v 3 are the end vertices of e 2 . In Fig. 2.4, v 2 is a 
predecessor of v 3 which is a successor of v 2 . Also, v 4 is a predecessor of v 2 and 
successor of v 3 . 

(ii) If G is a digraph, the undirected graph corresponding to G is the 
undirected graph obtained by considering the edges and vertices of G, but 
ignoring the ‘direction' of the edges. For example, the undirected graph 
corresponding to the digraph given in Fig. 2.4 is shown in Fig. 2.5. 



Definition 2.13 The degree of a vertex in a graph (directed or undirected) is 
the number of edges with v as an end vertex. (A self-loop is counted twice while 
calculating the degree.) In Fig. 2.3, deg^) = 2, deg(v 3 ) = 3, deg(T’ 2 ) = 5. In 
Fig. 2.4, deg(v 2 ) = 3, deg(v 4 ) = 2. 

We now mention the following theorem without proof. 

Theorem 2.4 The number of vertices of odd degree in any graph (directed or 
undirected) is even. 

Definition 2.14 A path in a graph (undirected or directed) is an alternating 
sequence of vertices and edges of the form v x e\v 2 e 2 . . . v n _ { e n _iv w beginning 
and ending with vertices such that e r - has and v/+i as its end vertices and 
no edge or vertex is repeated in the sequence. The path is said to be a path 
from \>i to v„. 

For example. v { e 2 v 2 exv 2 is a path in Fig. 2.3. It is a path from to v 2 . In 
Fig. 2.4. v l e 2 v 2 e 2 v 2 is a path from .vj to v 2 . vi^v 2 is also a path from v\ to v 2 . 
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And v 2 e 4 v 4 e 5 v 2 is a path from v 3 to v 2 . We call v 3 e 4 v 4 e 5 v 2 a directed path since 
the edges e 4 and e 5 have the forward direction. (But v jt e 2 v 3 e 3 v 2 is not a directed 
path as e 2 is in the forward direction and e 3 is in the backward direction.) 

Definition 2.15 A graph (directed or undirected) is connected if there is a 
path between every pair of vertices. 

The graphs given by Figs. 2.3 and 2.4. for example, are connected. 

Definition 2.16 A circuit in a graph is an alternating sequence v l eiv 2 e 2 . . . 
e n -\V\ of vertices and edges starting and ending in the same vertex such that 
e i has Vj and v /+1 as the end vertices and no edge or vertex other than v t is 
repeated. 

In Fig. 2.3. for example. v 3 e 3 v 2 e 5 v 4 e 4 v 3 , \'ie 2 v 3 e 4 v 4 e 5 v 2 eivi are circuits. In 
Fig. 2.4. v\e 2 v 3 e 3 v 2 e\V\ and v 2 e 3 v 3 e 4 v 4 e 5 v 2 are circuits. 

2.2.2 Trees 

Definition 2.17 A graph (directed or undirected) is called a tree if it is 
connected and has no circuits. 

The graphs given in Figs. 2.6 and 2.7. for example, are trees. The graphs 
given in Figs. 2.3 and 2.4 are not trees. 

Note: A directed graph G is a tree iff the corresponding undirected graph 
is a tree. 



Fig. 2.6 A tree with four vertices. 



We now discuss some properties of trees (both directed and undirected) 
used in developing transition systems and studying grammar rules. 
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Property 1 A tree is a connected graph with no circuits or loops. 

Property 2 In a tree there is one and only one path between every pair of 
vertices. 

Property 3 If in a graph there is a unique (i.e. one and only one) path 
between every pair of vertices, then the graph is a tree. 

Property 4 A tree with n vertices has n - 1 edges. 

Property 5 If a connected graph with n vertices has n - 1 edges, then it is 
a tree. 

Property 6 If a graph with no circuits has n vertices and n - I edges, then 
it is a tree. 

A leaf in a tree can be defined as a vertex of degree one. The vertices 
other than leaves are called internal vertices. 

In Fig. 2.6. for example. v h v ? , v 4 are leaves and v 2 is an internal vertex. 
In Fig. 2.7. v 2 , v 5 , v- are leaves and vj, v 3 , v 4 are internal vertices. 

The following definition of ordered trees will be used for representing 
derivations in context-free grammars. 

Definition 2.18 An ordered directed tree is a digraph satisfying the following 
conditions: 

7]: There is one vertex called the root of the tree which is distinguished 
from all the other vertices and the root has no predecessors. 

T< There is a directed path from the root to every other vertex. 
r 3 : Every 7 vertex except the root has exactly one predecessor. 

7 4 : The successors of each vertex are ordered 'from the left’. 

Note: The condition 7 4 of the definition becomes evident once we have the 
diagram of the graph. 

Figure 2.7 is an ordered tree with vj as the root. Figure 2.8 also gives an 
ordered directed tree with v A as the root. In this figure the successors of are 
ordered as v 2 v 3 . The successors of v 3 are ordered as v 5 v 6 . 



Fig. 2.8 An ordered directed tree. 
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By adopting the following convention, we can simplify Fig, 2.8. The root 
is at the top. The directed edges are represented by arrows pointing downwards. 
As all the arrows point downwards, the directed edges can be simply 
represented by lines sloping downwards, as illustrated in Fig. 2.9. 



Fig. 2.9 Representation of an ordered directed tree. 

Note: An ordered directed tree is connected (which follows from 77). It has 
no circuits (because of T 3 ). Hence an ordered directed tree is a tree (see 
Definition 2.17). 

As we use only the ordered directed trees in applications to grammars, we 
refer to ordered directed trees as simply trees. 

Definition 2.19 A binary 7 tree is a tree in which the degree of the root is 2 and 
the remaining vertices are of degree 1 or 3. 

Note: In a binary tree any vertex has at most two successors. For example, the 
trees given by Figs, 2.11 and 2.12 are binary trees. The tree given by Fig. 2.9 
is not a binary tree. 

Theorem 2.5 The number of vertices in a binary tree is odd. 

Proof Let n be the number of vertices. The root is of degree 2 and the 
remaining n~ 1 vertices are of odd degree (by Definition 2.19). By 
Theorem 2.4. n - 1 is even and hence n is odd. I 

We now introduce some more terminology regarding trees: 

(i) A son of a vertex v is a successor of v. 

(ii) The father of v is the predecessor of v. 

(iii) If there is a directed path from vj to vj is called an ancestor of 
and vs is called a descendant of Vj. ( Convention: v t is an ancestor of 
itself and also a descendant of itself.) 

(iv) The number of edges in a path is called the length of the path. 

(v) The height of a tree is the length of a longest path from the root. For 
example, for the tree given by Fig. 2.9, the height is 2. (Actually there 
are three longest paths, v\ is —> v* v\ —» v 3 —> v 5 , v { —» vs —> v 6 . 
Each is of length 2.) 

(vi) A vertex v in a tree is at level k if there is a path of length k from the 
root to the vertex v (the maximum possible level in a tree is the height 
of the tree). 
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Figure 2.10, for example, gives a tree where the levels of vertices are 
indicated. 


Root 



Fig. 2.10 illustration of levels of vertices. 



For a binary tree T with n vertices, show that the minimum possible height 
is riog : (/7 + 1) - 11 where f £1 is the smallest integer > h and the maximum 
possible height is (n - l)/2. 

Solution 


In a binary tree the root is at level 0. As every vertex can have at most two 
successors, we have at most two vertices at level 1. at most 4 vertices at level 
2, etc. So the maximum number of vertices in a binary tree of height k is 
1 + 2 + 2" + ■ ♦ • + 2\ As T has n vertices, I + 2 + 2~ + • * * + 2 k > n, i.e. 
(2* +1 - l)/(2 -1) > ju so k > log 2 (n + 1) - 1. As k is an integer, the smallest 
possible value for k is F log 2 (n + 1) - 11 Thus the minimum possible height 
is f log 2 (« + 1) - 11 

To get the maximum possible height, we proceed in a similar way. In 
a binary tree we have the root at zero level and at least two vertices at level 
1.2,.... When T is of height k, we have at least 1 + 2 + ■ • • + 2 (2 repeated 
k times) vertices. So, 1 + 2£ < n, i.e. k < (n - l)/2. But, n is odd by 
Theorem 2.4. So in - l)/2 is an integer. Hence the maximum possible value 
for k is (?? - l)/2. 


EXAMPLE 2.16 


When n - 9. the trees with minimum and maximum height are shown 
in Figs, 2.11 and 2.12 respectively. The height of the tree in Fig. 2.11 is 
Flog2(9 + 1) - 1] = 3. For the tree in Fig. 2.12. the height = (9 - l)/2 = 4. 
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EXAMPLE 2.17 


Prove that the number of leaves in a binary tree T is (/? + l)/2, where n is 
the number of vertices. 

Solution 

Let m be the number of leaves in a tree with n vertices. The root is of degree 
2 and the remaining n - m - 1 vertices are of degree 3. As T has n vertices, 
it has /? - 1 edges (by Property 4). As each edge is counted twice while 
calculating the degrees of its end vertices. 2 (n - 1) = the sum of degrees of all 
vertices = 2 + 777 + 3(n - m - 1). Solving for m , we get m = (n + l)/2. 


EXAMPLE 2.18 


For the tree shown in Fig. 2.13, answer the following questions: 
(a) Which vertices are leaves and which internal vertices? 
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(b) Which vertices are the sons of 5? 

(c) Which vertex is the father of 5? 

(d) What is the length of the path from 1 to 9? 

(e) What is the left-right order of leaves? 

(f) What is the height of the tree? 



Fig. 2.13 The directed tree for Example 2.18, 

Solutions 

(a) 10, 4, 9, 8, 6 are leaves. 1, 2. 3, 5, 7 are internal vertices. 

(b) 7 and 8 are the sons of 5. 

(c) 3 is the father of 5. 

(d) Four (the path is 1 —» 3 —» 5 -» 7 9). 

(e) 10-4-9-8-6. 

(f) Four (1 —> 3 —» 5 -» 7 —> 9 is the longest path). 

2.3 STRINGS AND THEIR PROPERTIES 

A string over an alphabet set X is a finite sequence of symbols from X. 

NOTATION: X* denotes the set of all strings (including A, the empty string) 
over the alphabet set X. That is, X + = X* - {A}. 

2.3.1 Operations on Strings 

The basic operation for strings is the binary concatenation operation. We 
define this operation as follows: Let x and y be two strings in X*. Let us form 
a new string z by placing y after x, i.e. z = xv. The string z is said to be 
obtained by concatenation of x and y. 
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EXAMPLE 2.19 


Find xv and yx, where 

(a) x = 010, y = 1 

(b) x = aA, y = ALGOL 

Solution 

(a) xy = 0101, yjc = 1010. 

(b) xy — a a .ALGOL 
yx - ALGOL a A. 

We give below 7 some basic properties of concatenation. 

Property 1 Concatenation on a set X* is associative since for each x, y, z in 
X* x(yz) = (pcy)z. 

Property 2 Identity ? element. The set X* has an identity element A w.r.t. 
the binary operation of concatenation as 

xA = Ax = x for every x in X* 

Property 7 3 X* has left and right cancellations. For x, v, z in X* 
zk = zy implies x = y (left cancellation) 
xz = yz implies x = y (right cancellation) 

Property 4 For x, y in X* we have 

I •*>’ | = | * | + \ y\ 

where | x | , | y |, | xy j denote the lengths of the strings x, y, xy, respectively. 
We introduce below 7 some more operations on strings. 

Transpose Operation 

We extend the concatenation operation to define the transpose operation as 
follows: 

For any x in X* and a in X, 

(xaf = a(x) T 

For example, (, aaabab) 7 is babaaa. 

Palindrome. A palindrome is a string which is the same whether written 
forward or backward, e.g. Malayalam. A palindrome of even length can be 
obtained by concatenation of a string and its transpose. 

Prefix and suffix of a string. A prefix of a string is a substring of leading 
symbols of that string. For example, w is a prefix of y if there exists y in X* 
such that y = wy . Tnen we write w < y. For example, the string 123 has four 
prefixes, i.e. A, 1, 12, 123. 

Similarly, a suffix of a string is a substring of trailing symbols of that 
string, i.e. w is a suffix of y if there exists / e X* such that y = yw. For 
example, the string 123 has four suffixes, i.e. A, 3, 23, 123. 



56 a Theory of Computer Science 


Theorem 2.6 (Levi’s theorem) Let v, w, x and y e Z* and vw = xy. Then: 

(i) there exists a unique string z in Z* such that v = xz and y = zw if 

_. M > l*|; 

(ii) v = x, y = w % i.e. z = A if |v| = |x|; \ 

(iii) there exists a unique string z in E* such that x = vz, and w = zy if 

| V j < |x|. 

Proof We shall give a very simple proof by representing the strings by a 
diagram (see Fig. 2.14). I 







% x * * y *■ 

Case 1: \v\ > |x| v-xz y- zw 




* x * * y ' 

Case 2: \v\-\x\ v-x w-y 






* X * * y 


Case 3: \v\ < \x\ x -vz w- zy 


Fig. 2.14 Illustration of Levi’s theorem. 


2.3.2 Terminal and Nonterminal Symbols 

The definitions in this section will be used in subsequent chapters. 

A terminal symbol is a unique indivisible object used in the generation of 
strings. 

A nonterminal symbol is a unique object but divisible, used in the 
generation of strings. A nonterminal symbol will be constructed from the 
terminal symbols: the number of terminal symbols in a nonterminal symbol 
may vary; it is also called a variable. In a natural language, e.g. English, the 
letters a, b, A, B, etc. are terminals and the words boy, cat, dog, go are 
nonterminal symbols. In programming languages, A, B, C, . . Z, :, =, begin, 
and. if, then, etc. are terminal symbols. 

The following will be a variable in Pascal: 

< For statement > » for < control variable > : = 

< for list > do < statement > 
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2.4 PRINCIPLE OF INDUCTION 

The process of reasoning from general observations to specific truths is called 
induction . 

The following properties apply to the set N of natural numbers'and the 
principle of induction. \ 

Property 1 Zero is a natural number. 

Property 2 The successor of any natural number is also a natural number. 
Property 3 Zero is not the successor of any natural number. 

Property 4 No two natural numbers have the same successor. 

Property 5 Let a property P(n) be defined for every natural number n. If 
(i) P( 0) is true, and (ii) /^successor of n) is true whenever P(n) is true, then 
P(n) is true for all n. 

A proof by complete enumeration of all possible combinations is called 
perfect induction . e.g. proof by truth table . 

The method of proof by induction can be used to prove a property P(n) for 
all n. 

2.4.1 Method of Proof by Induction 

This method consists of three basic steps: 

Step 1 Prove P(n) for n = 0/1. This is called the proof for the basis. 

Step 2 Assume the result/properties for Pin). This is called the induction 
hypothesis. 

Step 3 Prove Pin +1) using the induction hypothesis. 


EXAMPLE 2.20 


Prove that l + 3 + 5 + -** + r = /r, for all n > 0, where r is an odd integer 
and n is the number of terms in the sum. (Note: r = 2 n - 1.) 

Solution 

(a) Proof for the basis. For n = I. L.H.S. = 1 and R.H.S. = l 2 = 1. Hence 
the result is true for n - 1. 

(b) By induction hypothesis, we have l + 3 + 5 + -*- + r = rr. As r - 2 n - 1, 

L.H.S. = 1 + 3 + 5 + ••• + (2 n - 1) = ir 

(c) We have to prove that l + 3+ 5 + -- - + r+ r + 2 = (n + l} 2 : 

L.H.S. = (I + 3 + 5 4- + r + (r + 2)) 

= /!' + r t 2 = n~ + In - 1 + 2 = (n + 1)~ = R.H.S. 
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EXAMPLE 2.21 


Prove the following theorem by induction: 

1 + 2 + 3 + ••• + /?.= n(n + l)/2 

Solution 

(a) Proof for the basis. For n = 1, L.H.S. = 1 and 
R.H.S. = 1(1 + l)/2 = 1 

(b) Assume l + 2 + 3+ -* + n== n(n + l)/2. 

(c) We have to prove: 

1 + 2 + 3 + ••• + (/? + 1) = (n + 1 ){n + 2)/2 
1 + 2 + 3 + ••* + « + (n + 1) 

= n(n + l)/2 + (n + 1) (by induction hypothesis) 

= (n + l)(n + 2)12 (on simplification) 

The proof by induction can be modified as explained in the following 
section. 

2.4.2 Modified Method of Induction 

Three steps are involved in the modified proof by induction. 

Step 1 Proof for the basis (n = 0/1). 

Step 2 Assume the result/properties for all positive integers < n + 1. 

Step 3 Prove the result/properties using the induction hypothesis (i.e. step 2), 
for n + 1. 

Example 2.22 below illustrates the modified method of induction. The 
method we shall apply will be clear once w'e mention the induction hypothesis. 


EXAMPLE 2.22 


Prove the following theorem by induction: A tree with n vertices has (n - 1) 
edges. 

Solution 

For n = 1, 2, the following trees can be drawn (see Fig. 2.15). So the theorem 
is true for n = 1, 2. Thus, there is basis for induction. 

9 

O 

O 

n = 1 n = 2 

Fig. 2.15 Trees with one or two vertices. 
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Consider a tree T with (n + 1) vertices as shown in Fig, 2.16. Let e be 
an edge connecting the vertices v f - and Vj. There is a unique path between v } 
and vj through the edge e. (Property of a tree: There is a unique path between 
every pair of vertices in a tree.) Thus, the deletion of e from the graph will 
divide the graph into two subtrees. Let n ] and n 2 be the number of vertices 
in the subtrees. As 77 1 < n and n 2 ^ n, by induction hypothesis, the total 
number of edges in the subtrees is - 1 + n 2 - 1, i.e. n - 2. So, the number 
of edges in T is n - 2 + 1 = n - 1 (by including the deleted edge e ). By 
induction, the result is true for all trees. 



Fig. 2.16 Tree T with (n + 1) vertices. 


EXAMPLE 2.23 


Two definitions of palindromes are given below 7 . Prove by induction that the 
two definitions are equivalent. 

Definition 1 A palindrome is a string that reads the same forward and 
backward. 

Definition 2 (i) A is a palindrome. 

(ii) If a is any symbol, the string a is a palindrome. 

(iii) If a is any symbol and x is a palindrome, then axa is a palindrome. 

(iv) Nothing is a palindrome unless it follows from (i)~(iii). 

Solution 

Let x be a string which satisfies the Definition 1, i.e. x reads the same forward 
and backward. By induction on the length of x we prove that satisfies 
the Definition 2. 

If | x | < 1. then x = a or A. Since x is a palindrome by Definition 1, A 

and a are also palindromes (hence (i) and (ii)), i.e. there is basis for induction. 

If | x | > 1, then v = awci, where w. by Definition 1, is a palindrome; hence the 
rule (iii). Thus, if satisfies the Definition L then it satisfies the Definition 2. 

Let a* be a string which is constructed using the Definition 2. We 
show by induction on | x | that it satisfies the Definition 1. There is basis 
for induction by rule (ii). Assume the result for all strings with length < n. 
Let x be a string of length n. As x has to be constructed using the 

rule (iii). x = ay a, where y is a palindrome. As y is a palindrome by 

Definition 2 and j v | < n, it satisfies the Definition 1. So, x = ay a also satisfies 
the Definition 1. 
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EXAMPLE 2.24 


Prove the pigeonhole principle. 

Proof We prove the theorem by induction on m. if m - 1 and n > h then 
all these n items must be placed in a single place. Hence the theorem is true 
for m = 1. 

Assume the theorem for m . Consider the case of m + 1 places. We prove 
the theorem for n - m + 2. (If n > m + 2. already one of the m + 1 places will 
receive at least two objects from m + 2 objects, by what we are going to prove.) 
Consider a particular place, say, P. 

Three cases arise: 

(i) P contains at least two objects. 

(ii) P contains one object 

(iii) P contains no object. 

In case (i), the theorem is proved for n = m + 2. Consider case (ii). As P 
contains one object, the remaining m places should receive m + 1 objects. By 
induction hypothesis, at least one place (not the same as P) contains at least two 
objects. In case (iii), m + 2 objects are distributed among m places. Once again, 
by induction hypothesis, one place (other than P) receives at least two objects. 
Hence, in all the cases, the theorem is true for (m + I) places. By the principle 
of induction, the theorem is true for all m. 

2.4.3 Simultaneous Induction 

Sometimes we may have a pair of related identities. To prove these, we may 
apply tv/o induction proofs simultaneously. Example 2.25 illustrates this 
method. 


EXAMPLE 2.25 


A sequence F 0 , fj, F 2 . . . . called the sequence of Fibonacci numbers (named 
after the Italian mathematician Leonardo Fibonacci) is defined recursively as 
follows: 


F „ + 1 = 

F„ t* ;?- J* 

F 0 = 0. F { = 1 


P„ 

: Fn + Fix 

= Fn- 1 

(2.2) 

Qn 

: F n +\F n + 

F „F n _ i = F 2i , 

(2.3) 


Proof We prove the two identities (2.2) and (2.3) simultaneously by 
simultaneous induction. P> and Q { are Fj 2 + F 0 2 = F l and F 2 F { + F { F () = F 2 
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respectively. As F 0 = 0. F { = 1, F 2 =1. these are true. Hence there is basis 
for induction. Assume P n and Q n . So 


Fn + Fix = F 2n -1 

F n +\F n + F n F„_ i = 

Now, 

Fix + F; = (F„-i + F ,,) 2 + F; 


= Fix 

+ Ft + 

2F„_, 

F„ + Ft 

= (F,l 

i + Ft) 

+ F lt _ 

l F n + F~ + F n _ ] F n 

1! 

I l ° 

+ Ft + 

(F „-1 

+ F„)F„ + F n F„-x 

= (Fl 

i + F~) 

+ F n+ 

\F„ + 

r ^ 

II 

l + f 2i , 

(by (2.3)) 


~ F ln +1 

This proves P„ +1 . 

Also, 

^-2^i + F^F, = (F w+1 + F„)F„ +1 + (F, + F^F, 
= F 2 +1 + F n+1 F n + F„F /? _! + F; 

= (*£+1 + F ;; 2 ) + ( F n+\ F n + F n F n-\) 


( 2 . 2 ) 

(2.3) 


= F 2 „ +1 + F 2n (By P n+] and (2.3)) 

= F 2n+2 

This proves Q n + j. 

So, by induction (2.2) and (2.3) are true for all n . 

We conclude this chapter with the method of proof by contradiction. 


2.5 PROOF BY CONTRADICTION 

Suppose we want to prove a property P under certain conditions. The method 
of proof by contradiction is as follows: 

Assume that property P is not true. By logical reasoning get a conclusion 
which is either absurd or contradicts the given conditions. 

The following example illustrates the use of proof by contradiction and 
proof by induction. 


EXAMPLE 2.26 


Prove that there is no string x in {a such that ax = xb. (For the definition 
of strings, refer to Section 2.3.) 



62 S Theory of Computer Science 


Proof We prove the result by induction on the length of x. When \x | = L 
x - a or x = b. In both cases ax J=- xb. So there is basis for induction. Assume 
the result for any string whose length is less than n. Let x be any string of length 
n. We prove that ax ^ xb through proof by contradiction. Suppose ax = xb. As 
a is the first symbol on the L.H.S., the first symbol of jc is a . As b is the last 
symbol on R.H.S.. the last symbol of x is h. So, we can write x as ayb with 
j y | = n - 2. This means aayb - aybb which implies ay = yb. This contradicts 
the induction hypothesis. Thus, ax & xb . By induction the result is tme for all 
strings. 


2.6 SUPPLEMENTARY EXAMPLES 


EXAMPLE 2.27 


In a survey of 600 people, it was found that: 

250 read the Week 
260 read the Reader's Digest 
260 read the Frontline 
90 read both Week and Frontline 
110 read both Week and Reader's Digest 
80 read both Reader's Digest and Frontline 
30 read all the three magazines. 

(a) Find the number of people who read at least one of the three 
magazines. 

(b) Find the number of people who read none of these magazines. 

(c) Find the number of people who read exactly one magazine. 

Solution 

Let W, R, F denote the set of people who read Week , Reader's Digest 
and Frontline , respectively. We use the Venn diagram to represent these sets (see 
Fig. 2.17). 



Fig. 2.17 Venn diagram for Example 2.27. 
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It is given that: 

j W | = 250, \R\ = 260, | FI = 260, |WnF|=9Q, 

| W n.F| = 110, |F n F| = 80, |Wni?nF| = 30 
| W u R u F| 

= |W| + |F| + |F| - |Wn F| - |Wn R \ - |F n F| + | W n R r» F| 
= 250 + 260 + 260 - 90 - 110 - 80 + 30 = 520 
So the solution for (a) is 520. 

(b) The number of people who read none of the magazines 

= 600 - 520 = 80 

Using the data, we fill up the various regions of the Venn diagram. 

(c) The number of people who read only one magazine 

= 80 + 100 + 120 = 300 


EXAMPLE 2.28 


Prove that (A u B u Q c = (A u Bf n (A u C) c 

Solution 

(A u B u C) = (A u A) u B u C 

= A u (A u B) u C = (A u B) u (A u C) 
Hence (A u B u C) f = (A u 5) r n (A u C) r (by DeMorgan’s law) 


EXAMPLE 2.29 


Define ciRb if b = for some positive integer k\ a, b e z- Show that R is 
a partial ordering. (A relation is a partial ordering if it is reflexive, 
antisymmetric and transitive.) 

Solution 

As a = a\ we have aRa . To prove that R is antisymmetric, we have to prove 
that ciRb and bRa => a - b. As aRb , we have Z? = a k . As we have a = 
Hence a~b ] = (a A ) / = a A/ . This is possible only when one of the following holds 
good: 

(i) a = 1 

(ii) a = -1 

(iii) kl = 1 

In case (i), b = a k = 1. So a = 2?. 

In case (ii). a = -1 and so kl is odd. This implies that both k and l are 
odd. So 

b = a k = (-1)* = -I = a 

In case (iii), kl = 1. As /; and / are positive integers, k = l - 1. So 

h - a k - a . 
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If aRb and bRc\ then b = a k and c - h l for some k , l. Therefore, c = a Kl . 
Hence ciRc. 


EXAMPLE 2.30 


Suppose A = {1, 2, . . 9} and - relation on A x A is defined by (m, t?) ~ 

(p, #) if rn + q = n + p, then prove that ~ is an equivalence relation. 

Solution 

(m, ~ (w, ft) since m + n = n + m. So - is reflexive. If (m, n) ~ ip , r/), 
then m + q = w + p; thus p + n = q + m. Hence (p, <7) - (m, ft). So ~ is 
symmetric. 

If (m, ft) ~ (p, r/) and (p, </) ~ (r, s). then 

m + q = ft 4- p and p + 5 = r/ + r 

Adding these, 

m + ^+ p + 5 = n+ p + qr + r 

That is. 

m + s-n + r 

which proves (in, n) ~ (/*, s ). 

Hence - is an equivalence relation. 


EXAMPLE 2.31 


If /: A —> B and g : B C are one-to-one, prove that g o/ is one-to-one. 

Solution 

Let us assume that go/^t) = g of(a 2 ). Then, g(f(a\)) = g(/(# 2 ))* As g is 
one-to-one,/(a i) = f(a 2 ). As/is one-to-one, a] = a 2 . Hence go/is one-to- 
one. 


EXAMPLE 2.32 


Show that a connected graph G with 72 vertices and n - 1 edges (n > 3) has 
at least one leaf. 

Solution 

G has n vertices and n - 1 edges. Every edge is counted twice while computing 
the degree of each of its end vertices. Hence 

S deg(v) = 2 (tz - 1) 

where summation is taken over all vertices of G. 

So, I deg(v) is the sum of n positive integers. If deg(v) > 2 for every 
vertex v of G, then 

2 n < Z deg(v) = 2 (n - 1) 

which is not possible. 

Hence deg(v) = 1 for at least one vertex v of G and this vertex v is a leaf. 
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EXAMPLE 2.33 


Prove Property 5 stated in Section 2.2.2. 

Solution 

We prove the result by induction on ru Obviously, there is basis for induction. 
Assume the result for connected graphs with n - 1 vertices. Let T be a 
connected graph with n vertices and n - 1 edges. By Example 2.32, T has at 
least one leaf v (say ). 

Drop the vertex v and the (single) edge incident with v. The resulting graph 
G' is still connected and has n - 1 vertices and n - 2 edges. By induction 
hypothesis. G ; is a tree. So G' has no circuits and hence G also has no circuits. 
(Addition of the edge incident with v does not create a circuit in G.) Hence G 
is a tree. By the principle of induction, the property is true for all n. 


EXAMPLE 2.34 


A person climbs a staircase by climbing either (i) two steps in a single stride 
or (ii) only one step in a single stride. Find a formula for S(n ), where Sin) 
denotes the number of ways of climbing n stairs. 

Solution 

When there is a single stair, there is only one way of climbing up. Hence 
5(1) = 1. For climbing two stairs, there are two ways, viz. two steps in a single 
stride or two single steps. So 5(2) = 2. In reaching n steps, the person can climb 
either one step or two steps in his last stride. For these two choices, the number 
of ways are Sin - 1) and S(?i - 2). 

So, 

S(n) = 5(7? - 1) + 5(7? - 2) 

Thus. 5 (h) = F(n), the nth Fibonacci number (refer to Exercise 2.20, 
at the end of this chapter). 


EXAMPLE 2.35 


How many subsets does the set {1, 2, . . .. ??} have that contain no two 
consecutive integers? 

Solution 

Let S n denote the number of subsets of {I, 2, .... n) having the desired 
property. If n = 1, 5j = |{0. {1}| = 2. If n = 2. then 5 2 = j{0, {!}. {2}j =3. 

Consider a set A with n elements. If a subset having the desired property 
contains ;?, it cannot contain n - 1. So there are 5„_ 2 such subsets. If it does 
not contain n. there are S n _\ such subsets. So 5„ = 5 7? _j + S n _ 2 . As 5i = 2 = F\ 
and S 2 - 3 = F 4 , 

Sn = F n+2 

the (n + 2)th Fibonacci number. 
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EXAMPLE 2.36 


If n > 1, show that 

1-1! + 2-2 ! + •■•+«•»! = (n + !)!-! 


Solution 

We prove the result by induction on n. If n = 1. then 1 ■ 1! = 1 = (1 + 1)! - 1. 
So there is basis for induction. 

Assume the result for n, i.e. 

M! + 2-2! + ■ ■ • + n-nl = (n + 1)! - 1 

Then, 

11! + 2-2! + • ■ • + n-nl + (n + l)-(n + 1)! 

= (n + 1)! - 1 + (n + 1)• (n + 1)! 

= (n + 1)! (1 + n + 1) - 1 = (« + 2)! - 1 

Hence the result is true for n + 1 and by the principle of induction, the 
result is true for all n > I. 


EXAMPLE 2.37 


Using induction, prove that 2" < n! for all n > 4. 

Solution 

For n = 4, 2 4 < 4!. So there is basis for induction. Assume 2" < n !. 
Then. 

2" +l = 2 n ■ 2 < nl ■ 2 < (n + l)n! = (n + 1)! 

By induction, the result is true for all n > 4. 


SELF-TEST 

Choose the correct answer to Questions 1-10: 

1. (A u A) n ( B n B) is 

(a) A (b ) A T\ B (c) B (d) none of these 

2. The reflexive-transitive closure of the relation {(1, 2), (2, 3)} is 

(a) {11, 2), (2, 3), (1, 3)} 

(b) {(1, 2), (2. 3), (1, 3), (3, 1)} 

(c) {(1, 1). (2. 2), (3, 3). (1, 3), (1, 2), (2, 3)} 

(d) {(1, 1). (2, 2), (3. 3), (1, 3)} 

3. There exists a function 


/: U. 2 


10} {2, 3, 4. 5, 6, 7, 9, 10, 11, 12} 
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which is 

(a) one-to-one and onto 

(b) one-to-one but not onto 

(c) onto but not one-to-one 

(d) none of these 

4. A tree with 10 vertices has 

(a) 10 edges (b) 9 edges (c) 8 edges (d) 7 edges. 

5. The number of binary trees with 7 vertices is 

(a) 7 (b) 6 (c) 2 (d) 1 

6. Let N = {1, 2, 3, . .Then f : N -> N defined by f(n) - n + 1 is 

(a) onto but not one-to-one 

(b) one-to-one but not onto 

(c) both one-to-one and onto 

(d) neither one-to-one nor onto 

7. QST is a substring of 

(a) PQRST 

(b) ORSTU 

(c) QSPQSTUT 

(d) QQSSTT 

8. If jc = 01, y = 101 and ^ = OIL then xyzy is 

(a) 01011011 

(b) 01101101011 

(c) 01011101101 

(d) 01101011101 

9. A binary tree with seven vertices has 

(a) one leaf 

(b) two leaves 

(c) three leaves 

(d) four leaves 

10. A binary operation o on N = {L 2. 3. . . .} is defined by a o b — 
a + 2b, Then: 

(a) o is commutative 

(b) o is associative 

(c) N has an identity element with respect to o 

(d) none of these 

EXERCISES 

2.1 If A = {ci, b} and B = {b, c}, find: 

(a) (A u 5)* 

(b) (A n B)* : 

(c) A* u B* 
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(d) A* n B* 

(e) (A ~ B)* 

(f) (.B - A)* 

2.2 Let S = {<7, £>} * For x, y e 5, define x o y = xv. i.e. x o y is obtained 
by concatenating x and y. 

(a) Is 5 closed under o? 

(b) Is o associative? 

(c) Does 5 have the identity element with respect to o? 

(d) Is o commutative? 

2.3 Let S = 2 X , where X is any nonempty set. For A, B c X, let 
A o Z? = A u 

(a) Is o commutative and associative? 

(b) Does S have the identity element with respect to o? 

(c) If A o B = A o C does it imply that B ~ Cl 

2.4 Test whether the following statements are true or false. Justify your 
answer. 

(a) The set of all odd integers is a monoid under multiplication. 

(b) The set of all complex numbers is a group under multiplication. 

(c) The set of all integers under the operation o given by a a b ~ 
a + b - ab is a monoid, 

(d) 2 s under symmetric difference y defined by A y B = (A - B) u 
(B - A) is an abelian group. 

2.5 Show 7 that the following relations are equivalence relations: 

(a) On a set S, ciRb if a - b, 

(b) On the set of all lines in the plane, lyRR if l\ is parallel to L. 

(c) On A 7 = {0, 1, 2, . .mRn if m differs from n by a multiple 
of 3. 

2.6 Show that the following are not equivalence relations: 

(a) On a set 5, aRb if a b. 

(b) On the set of lines in the plane, l\RU if l\ is perpendicular to h . 

(c) On = {0, 1, 2. . . mRn if m divides n . 

(d) On 5 = {L 2. 10}. aRb if a + b = 10. 

2.7 For x, y in {a, b }*, define a relation R by xRy if \x\ = Jy|. Show that 
X is an equivalence relation. What are the equivalence classes? 

2.8 For x, y in { a , Z?}*, define a relation X by xRy if jc is a substring of 
y (x is a substring of v if y = z\xzz for some string Z[, zi)- Is R an 
equivalence relation? 

2.9 Let R = {(1. 2). (2. 3). (L 4), (4, 2), (3, 4)}, Find R + , R*. 

2.10 Find /?* for the following relations: 

(a) R = {(I, 1), (L 2), (2, 1), (2, 3), (3, 2)} 

(b) R = {(L 1), (2. 3), (3, 4), (3. 2)} 
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(c) R - {(L 1), (2, 2), (3, 3). (4. 4)} 

(d) R = {(1, 2), (2, 3), (3, 1). (4. 4)} 

2.11 If R is an equivalence relation on 5, what can you say about R + , /?*? 

2.12 Let f:{a, b}* — > {a, b}* be given by fix) -- ax for every x € {a. b}*. 
Show that / is one-to-one but not onto. 

2.13 Let g : {a, b}* -> {a, bj* be given by g(x) = x T . Show that g is 
one-to-one and onto. 

2.14 Give an example of (a) a tree with six vertices and (b) a binary tree 
with seven vertices. 

2.15 For the tree T given in Fig. 2.18, answer the following questions: 

(a) Is F a binary tree? 

(b) Which vertices are the leaves of F? 

(c) How many internal vertices are in F? 

(d) What is the height of F? 

(e) What is the left-to-right ordering of leaves? 

(f) Which vertex is the father of 5? 

(g) Which vertices are the sons of 3? 



2.16 In a get-together, show that the number of persons who know- an odd 
number of persons is even. 

[Hint: Use a graph.] 

2.17 If X is a finite set. show that \ 2 X \ - 2—. 

2.18 Prove the following by the principle of induction: 

/ V' ,2 ”(« + l)(2n + 1) 

(£U L k = -6- 



1 _ n 

k{k + Ij (ii + 1) 


(c) 10 J;; - 1 is divisible by il for all n > 1. 
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2.19 Prove the following by the principle of induction: 

(a) 1 + 4 + 7 + ••• + (in - 2) = W - 3 ” ~ l l 

(b) 2 n > n for all n > 1 

(c) iff (2) = 2 and /( 2 k ) = 2f(2 k ~ v ) + 3, then f(2 k ) = (5/2) ■ 2* - 3. 

2.20 The Fibonacci numbers are defined in the following way: 

F(0) = I, F(l) = 1, F(n + 1) = F(n) + F(n - 1) 

Prove by induction that: 

n 

(a) F(2n + 1) = ^ F(2k) 

k=0 


(b) F(2n + 2) = X F( - 2k + 1) + 1 


2,21 Show that the maximum number of edges in a simple graph (i.e. a 
graph having no self-loops or parallel edges) is . 


2.22 If w g {a, b}* satisfies the relation abw = wab, show that \ w\ is even. 

2.23 Suppose there are an infinite number of envelopes arranged one after 
another and each envelope contains the instruction "open the next 
envelope’. If a person opens an envelope, he has to then follow the 
instruction contained therein. Show that if a person opens the first 
envelope, he has to open all the envelopes. 




The Theory of 
Automata 



In this chapter we begin with the study of automaton. We deal with transition 
systems which are more general than finite automata. We define the 
acceptability of strings by finite automata and prove that nondeterministic finite 
automata have the same capability as the deterministic automata as far as 
acceptability is concerned. Besides, we discuss the equivalence of Mealy and 
Moore models. Finally, in the last section, we give an algorithm to construct a 
minimum state automaton equivalent to a given finite automaton. 

3.1 DEFINITION OF AN AUTOMATON 

We shall give the most general definition of an automaton and later modify 
it to computer applications. An automaton is defined as a system where 
energy, materials and information are transformed, transmitted and used for 
performing some functions without direct participation of man. Examples are 
automatic machine tools, automatic packing machines, and automatic photo 
printing machines. 

In computer science the term 'automaton' means ‘discrete automaton' and 
is defined in a more abstract way as shown in Fig. 3.1. 



Fig. 3.1 Model of a discrete automaton. 
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The characteristics of automaton are now described. 

(i) Input . At each of the discrete instants of time t u U , . . ., t m the input 

values 7 b / 2 . I p , each of which can take a finite number of fixed 

values from the input alphabet X. are applied to the input side of the 
model shown in Fig. 3.1. 

(ii) Output. 0 1 , O q are the outputs of the model, each of which 

can take a finite number of fixed values from an output O. 

(iii) States . At any instant of time the automaton can be in one of the 
states q\, q 3 . . q n . 

(iv) State relation . The next state of an automaton at any instant of time 
is determined by the present state and the present input. 

(v) Output relation . The output is related to either state only or to both 

the input and the state. It should be noted that at any instant of time 
the automaton is in some state. On 'reading' an input symbol, the 
automaton moves to a next state which is given by the state relation. 

Note; An automaton in which the output depends only on the input is called 
an automaton without a memory. An automaton in which the output depends 
on the states as well, is called automaton with a finite memory. An automaton 
in which the output depends only on the states of the machine is called a 
Moore machine. An automaton in which the output depends on the state as 
well as on the input at any instant of time is called a Mealy machine. 


EXAMPLE 3.1 


Consider the simple shift register shown in Fig. 3.2 as a finite-state machine 
and study its operation. 



Fig. 3.2 A 4-bit serial shift register using D flip-flops. 


Solution 

The shift register (Fig. 3.2) can have 2 4 = 16 states (0000, 0001, . . 1111), 

and one serial input and one serial output. The input alphabet is X = {0, 1}, 
and the output alphabet is O = {0, 1}. This 4-bit serial shift register can be 
further represented as in Fig. 3.3. 
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Fig. 3.3 A shift register as a finite-state machine. 

From the operation* it is clear that the output will depend upon both the input 
and the state and so it is a Mealy machine. 

In general, any sequential machine behaviour can be represented by an 
automaton. 

3.2 DESCRIPTION OF A FINITE AUTOMATON 

Definition 3.1 Analytically, a finite automaton can be represented by a 
5-tuple (Q, L. <5. qo* F). where 

(i) Q is a finite nonempty set of states. 

(ii) I is a finite nonempty set of inputs called the input alphabet. 

(iii) 8 is a function which maps Qxl into Q and is usually called the direct 
transition function. This is the function which describes the change of 
states during the transition. This mapping is usually represented by a 
transition table or a transition diagram. 

(iv) q 0 e Q is the initial state. 

(v) F c Q is the set of final states. It is assumed here that there may be 
more than one final state. 

Note: The transition function which maps Q x £* into Q (i.e. maps a state 
and a string of input symbols including the empty string into a state) is called 
the indirect transition function. We shall use the same symbol 8 to represent 
both types of transition functions and the difference can be easily identified 
by the nature of mapping (symbol or a string), i.e. by the argument. <5 is also 
called the next state function. The above model can be represented graphically by 
Fig. 3.4. 


String being processed 



input 

tape 




Reading head 


Finite 

control 


Fig. 3,4 B lock diagram of a finite automaton. 
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Figure 3.4 is the block diagram for a finite automaton. The various 
components are explained as follows: 

(i) Input tape. The input tape is divided into squares, each square 
containing a single symbol from the input alphabet L. The end squares 
of the tape contain the endmarker (£ at the left end and the end- 
marker s at the right end. The absence of endmarkers indicates That 
the tape is of infinite length. The left-to-right sequence of symbols 
between the two endmarkers is the input string to be processed. 

(ii) Reading head. The head examines only one square at a time and can 
move one square either to the left or to the right. For further analysis, 
we restrict the movement of the R-head only to the right side. 

(iii) Finite control The input to the finite control will usually be the 
symbol under the R-head, say a, and the present state of the machine, 
say q, to give the following outputs: (a) A motion of R-head along 
the tape to the next square (in some a null move, i.e. the R-head 
remaining to the same square is permitted); (b) the next state of the 
finite state machine given by S(q, a). 


33 TRANSITION SYSTEMS 

A transition graph or a transition system is a finite directed labelled graph in 
which each vertex (or node) represents a state and the directed edges indicate 
the transition of a state and the edges are labelled with input/output. 

A typical transition system is shown in Fig. 3.5. In the figure, the initial 
state is represented by a circle with an arrow pointing towards it, the final state 
by two concentric circles, and the other states are represented by just a circle. 
The edges are labelled by input/output (e.g. by 1/0 or 1/1). For example, if the 
system is in the state q$ and the input 1 is applied, the system moves to state 
qi as there is a directed edge from q 0 to q j with label 1/0. It outputs 0. 



0/0 

Fig. 3.5 A transition system. 


We now give the (analytical) definition of a transition system. 

Definition 3.2 A transition system is a 5-tuple ( Q , E, 5, Q 0 , F ), w'here 

(i) Q, Z and F are the finite nonempty set of states, the input alphabet, 
and the set of final states, respectively, as in the case of finite automata; 

(ii) Q 0 c Q , and O 0 is nonempty: and 

(iii) 5 is a finite subset of Q x £* x Q. 
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In other words, if (q h vt\ q 2 ) is in <5, it means that the graph starts at the 
vertex q b goes along a set of edges, and reaches the vertex q 2 . The 
concatenation of the label of all the edges thus encountered is w. 

Definition 3.3 A transition system accepts a string w in X* if 

(i) there exists a path which originates from some initial state, goes 
along the arrows, and terminates at some final state; and 

(ii) the path value obtained by concatenation of all edge-labels of the path 
is equal to w. 


Example 3.2 


Consider the transition system given in Fig. 3.6. 


1/0 



Determine the initial states, the final states, and the acceptability of 101011, 
111010 . 

Solution 

The initial states are q b and q b There is only one final state, namely q 3 . 

The path-value of <7o</o#2(73 is 101011. As q 3 is the final state, 101011 is 
accepted by the transition system. But, 111010 is not accepted by the transition 
system as there is no path with path value 111010. 

Note: Every finite automaton ( Q , X. 8, g 0 , F) can be viewed as a transition 
system ( Q , X, 8' Q 0 , F) if we take Qq = {<7 0 } and 8 / = {{q y w, 8{q y w))\q e 
0 , w e X*}. But a transition system need not be a finite automaton. For 
example, a transition system may contain more than one initial state. 

3.4 PROPERTIES OF TRANSITION FUNCTIONS 

Property 1 8(q, A) = q is a finite automaton. This means that the state of the 

system can be changed only by an input symbol. 
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Property 2 For ail strings w and input symbols a, 

Siq, aw) = 8(8(q f a), w) 

S(q, \va) - 8(8(q f w), a) 

This property gives the state after the automaton consumes or reads the 
first symbol of a string aw and the state after the automaton consumes a prefix 
of the string wa. 


EXAMPLE 3.3 


Prove that for any transition function 8 and for any two input strings x and y, 

8(q, xy) = 8(8(q, a), y) (3.1) 

Proof By the method of induction on | y |, i.e. length of y. 

Basis: When | v | = 1, y = a e I 

L.H.S. of (3.1)= 8(q, xa) 

= 8(8 (q, x), a ) by Property 2 

= R.H.S. of (3.1) 

Assume the result, i.e. (3.1) for all strings x and strings y with | y | = n. Let 
y be a string of length n + 1. Write y = >'[<3 where j y x | = n. 

L.H.S. of (3.1)= S(q, xy { a) = 8(q, x,a), .V; = xyy 

= 8{8(q, .V]), a) by Property 2 

= 8(8(q, xv’i), a) 

= S(S(8(q, x), _yj), a) by induction hypothesis 

R.H.S. of (3.1)= 8(S(q, x), y,n) 

= 8(8(8(q, x), yO, a) by Property 2 

Hence. L.H.S. = R.H.S. This proves (3.1) for any string y of length n + 1. 
By the principle of induction. (3.1) is true for all strings. I 


EXAMPLE 3.4 


Prove that if 8(q, x) = 8(q, y), then 8(q, xz) = 8(q, yz) for all strings c in If. 

Solution 

8(q, xz) = 8(8(q, x), z) bv Example 3.3 

= 8(8(q, y), z) (3.2) 


By Example 3.3. 


8(q, yz) = 8(8(q, y), z) 
= 8( q, xz) 


(3.3) 
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3.5 ACCEPTABILITY OF A STRING BY A FINITE 
AUTOMATON 

Definition 3.4 A string x is accepted by a finite automaton 

M - (ft Z, 5 <7o F) 

if 5 (</q, x) = q for some q e F. 

This is basically the acceptability of a string by the final state. 

Note: A final state is also called an accepting state. 


EXAMPLE 3.5 


Consider the finite state machine whose transition function 8 is given by Table 3.1 
in the form of a transition table. Here, Q = { q 0 , q h cjy }, Z = {0, 1}, 
F = {< 7 0 }. Give the entire sequence of states for the input string 110001. 


TABLE 3.1 Transition Function Table for Example 3.5 


State 

0 

Input 

1 

@ 

Q2 


Q\ 


Qs 


Qo 

Q2 

Qq 


Qs 

<h 

<7i 


Q2 


Solution 


Hence, 


1 1 

<%j, 110101)= 8{q u 10101) 

I 

= 5(t/ 0 ,0101) 

‘ i 

= S(q% 101) 

I 

= < 5 ( 93 , 01 ) 

i 

= S(q h 1) 

= S(q 0 , A) 

= Qi) 


i i o i o 1 
% Q\ G 2 c h % 

The symbol i indicates that the current input symbol is being processed by the 
machine. 
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3.6 NONDETERMINISTIC FINITE STATE MACHINES 

We explain the concept of nondeterministic finite automaton using a transition 
diagram (Fig. 3.7). 



Fig. 3.7 Transition system representing nondeterministic automaton. 


If the automaton is in a state {c/ 0 } and the input symbol is 0, what will be 
the next state? From the figure it is clear that the next state will be either {q 0 } 
or Thus some moves of the machine cannot be determined uniquely by 
the input symbol and the present state. Such machines are called 
nondeterministic automata, the formal definition of which is now given. 

Definition 3.5 A nondeterministic finite automaton (NDFA) is a 5-tuple 
(Q f I, S y cjq, F), where 

(i) Q is a finite nonempty set of states; 

(ii) X is a finite nonempty set of inputs; 

(iii) 8 is the transition function mapping from Qxl into which is the 
power set of Q, the set of all subsets of Q\ 

(iv) q 0 e Q is the initial state; and 

(v) F c Q is the set of final states. 

We note that the difference between the deterministic and nondeterministic 
automata is only in 8. For deterministic automaton (DFA), the outcome is a 
state, i.e. an element of Q\ for nondeterministic automaton the outcome is a 
subset of Q. 

Consider, for example, the nondeterministic automaton whose transition 
diagram is described by Fig. 3.8. 

The sequence of states for the input string 0100 is given in Fig. 3.9. Hence, 
<5(<7 o> 0100) = {<7o> <?3> <74} 

Since q 4 is an accepting state, the input string 0100 will be accepted by 
the nondeterministic automaton. 
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Fig. 3.8 Transition system for a nondeterministic automaton. 



<74 

Fig. 3.9 States reached while processing 0100. 


Definition 3.6 A string w e X* is accepted by NDFA M if S(qo, w) contains 
some final state. 

Note: As M is nondeterministic, S(q Q , w) may have more than one state. So 
w is accepted by M if a final state is one among the possible states that M can 
reach on application of u\ 

We can visualize the working of an NDFA M as follows: Suppose M 
reaches a state q and reads an input symbol a . If 8(q, a) has n elements, 
the automaton splits into n identical copies of itself; each copy pursuing 
one choice determined by an element of 8(q , a). This type of parallel 
computation continues. When a copy encounters ( q , a) for which S(q, a) = 0, 
this copy of the machine ‘dies’; however the computation is pursued by the 
other copies. If any one of the copies of M reaches a final state after 
processing the entire input string n\ then w^e say that M accepts w. Another 
way of looking at the computation by an NDFA M is to assign a tree structure 
for computing S(q, w). The root of the tree has the label q . For every input 
symbol in w, the tree branches itself. When a leaf of the tree has a final state 
as its label, then M accepts u\ 

Definition 3.7 The set accepted by an automaton M (deterministic or 
nondeterministic) is the set of all input strings accepted by M. It is denoted by 
T(M). 
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3.7 THE EQUIVALENCE OF DFA AND NDFA 

We naturally try to find the relation between DFA and NDFA. Intuitively we 
now feel that: 

(i) A DFA can simulate the behaviour of NDFA by increasing the number 
of states. (In other words, a DFA (Q, X, S, q Q , F) can be viewed as an 
NDFA (Q, X, S', q 0 , F) by defining S'(q, a) = {S(q, a)}.) 

(ii) Any NDFA is a more general machine without being more powerful. 

We now give a theorem on equivalence of DFA and NDFA. 

Theorem 3.1 For every NDFA, there exists a DFA which simulates the 
behaviour of NDFA. Alternatively, if L is the set accepted by NDFA, then there 
exists a DFA which also accepts L. 

Proof Let M = (0 X, <5, q 0 , F) be an NDFA accepting L We construct a DFA 
M' as: 

Af = (Q\ X, 5, qo, n 

where 

(i) Q' = (any state in Q' is denoted by [q b q 2 . .. qj\, where q b q 2 , 
■ • ■■ qj € 0; 

(ii) = [r/ 0 ]; and 

(iii) F' is the set of all subsets of Q containing an element of F. 

Before defining S\ let us look at the construction of Q', q'o and F'. M is 
initially at q Q . But on application of an input symbol, say a, M can reach any 
of the states 8(qo, a). To describe M, just after the application of the input 
symbol a, we require all the possible states that M can reach after the 
application of a . So, M' has to remember all these possible states at any instant 
of time. Hence the states of M/ are defined as subsets of Q. As M starts with 
the initial state q 0 , q' 0 is defined as [gol- A string w belongs to T(M) if a final 
state is one of the possible states that M reaches on processing w. So, a final 
state in M / (i.e. an element of F') is any subset of Q containing some final 
state of M. 

Now we can define S': 

(iv) S'([qi, <72 . qil> a) = 8(q b a) u 8(q 2 , a) u . . , u 8(q j} a). 

Equivalently, 

<5'([<7b qi, • ■ •• gii, a) = \p u .. p,] 

if and only if 

<5({<?i. • •<7;h a) = {p 1( p 2 , ■ ■Pj). 

Before proving L = T(M'), wt prove an auxiliary result 

SXq'o, x) = [q y ..q-X (3-4) 

if and only if 8(q 0 , x ) = {q u . *<?;} for all x in X*. 

We prove by induction on \x j, the ‘if part, i.e. 

S'(q'o. .v) = [q h c7 2 , .. .. qi\ 
if 5(q 0 , x ) = {q x . q;}. 


(3.5) 
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When |a-| = 0. S(q 0f A) = {< 7 0 }. and by definition of S', 8'(q' 0 , A) = 
Qo = fool So. (3.5) is true for x with |a{ = 0. Thus there is basis for induction. 

Assume that (3.5) is true for all strings y with \y\ < m. Let x be a 
string of length m + 1. We can write x as ya, where |yj = m and a e S. Let 
8{q (h y) = {pu - • v Pj] and S(q 0 , ya) = {r 1? r 2 , . . .. /*}. As |vj < by 
induction hypothesis we have 

<5 Wo- v) = |/>i, • . A-] (3.6) 

Also. 

{rj, 73 . . .. r k ] = 8(q 0 , ya) = 8(S(q 0 , y), nr) = < 5 ({p,,.. . ., p,-}, nr) 

By definition of <5' 

8'([Pi, • • • * P/1 <*) = tTi* * • v **] (3.7) 

Hence. 

S'iq'-y ya) = 8'(8'(q^, y) t a) = ^([p], . . pj. c/) by (3.6) 

= Dv • ■ ** b y (3*7) 

Thus we have proved (3.5) for x = ya , 

By induction, (3.5) is true for all strings x. The other part (i.e. the ‘only if' 
part), can be proved similarly, and so (3.4) is established. 

Now, x e T(M) if and only if S(q, a) contains a state of F. By (3.4). 
8(q 0 , a) contains a state of F if and only if 8'(q'$ f x) is in F\ Hence, x e T(M) 
if and only if x s T(M'). This proves that DFA M' accepts L I 

Note: In the construction of a deterministic finite automaton equivalent to 
a given nondeterministic automaton M, the only difficult part is the construction 
of (V for Mi. By definition, 

k 

8'([q\ ■ • • q *], «) = U 8(q r a) 

So we have to apply <5 to (q h a) for each i = 1.2, . . k and take their 
union to get <5'([<7 i . . . qf\ t a). 

When <5 for M is given in terms of a state table, the construction is simpler. 
S(q h a) is given by the row corresponding to q x and the column corresponding 
to a. To construct S'([q] ... q k ]. a ), consider the states appearing in the rows 
corresponding to q\. .... q k , and the column corresponding to a. These states 
constitute 8\[q { . . . q k ], a). 

Note: We write 8' as 5 itself when there is no ambiguity. We also mark the 
initial state with -» and the final state with a circle in the state table. 


EXAMPLE 3.6 


Construct a deterministic automaton equivalent to 

M = ({<7o» <7i}- {0- 1}. <5, q Q , {<?<)}) 

where 8 is defined by its state table (see Table 3.2). 
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TABLE 3.2 State Table for Example 3.6 


State/1 

0 

■i 


<?o 

<7i 

<7i 

Qi 

<7o> Q\ 


Solution 

For the deterministic automaton M h 

(i) the states are subsets of {q Qy q x ), i.e. 0. [q 0 ], [q 0 , \q{\\ 

(ii) [g 0 ] is the initial state; 

(iii) [g 0 ] and [q§, r/ 3 ] are the final states as these are the only states 
containing q 0 : and 

(iv) § is defined by the state table given by Table 3.3. 

TABLE 3.3 State Table of for Example 3.6 


State/1 

0 

1 

0 

0 

0 

too] 

too] 

toi] 

toil 

toil 

too, <7iJ 

too.- 9i] 

too, q-,] 

too, <7ii 


The states q 0 and q A appear in the rows corresponding to q 0 and q x and the 
column corresponding to 0. So, <5([g 0 . <r/j] 5 0) = [# 0 , q{\. 

When M has n states, the corresponding finite automaton has 2 n states. 
However, w 7 e need not construct <5 for all these 2 n states, but only for those 
states that are reachable from [g 0 ]. This is because our interest is only in 
constructing M x accepting T{M). So, we start the construction of 8 for [q§\. We 
continue by considering only the states appearing earlier under the input 
columns and constructing 8 for such states. We halt when no more new states 
appear under the input columns. 


EXAMPLE 3.7 


Find a deterministic acceptor equivalent to 

M = ({tf 0 > <7i> <7 2 }> {a, b }, 8, q 0 , {</ 2 }) 
where <5 is as given by Table 3.4. 

TABLE 3.4 State Table for Example 3.7 

State 11 a b 

-><7o Qo, <7i <72 

<7i Qo <7i 

(jfe) <7o, Qi 
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Solution 

The deterministic automaton M x equivalent to M is defined as follows: 

M, = (22, -{a, b}, 8. [q 0 ], F") 

where 

F = lihil [%■ 4;]- [q\- qil [Qo- <7i- TiVi 

We start the construction by considering [< 70 ] first. We get [ q 2 ] and [<?o Qil* Then 
we construct 5 for [q 2 ] and [<70- < 7 i]- Uj\, Q2] is a r} ew state appearing under the 
input columns. After constructing 5 for [q\, q 2 ], we do not get any new states 
and so we terminate the construction of 8 . The state table is given by Table 3.5. 


TABLE 3.5 State Table of for Example 3,7 


State II 

a 

b 

[Qo] 

[Qo, Qil 

[Q 2 ] 

[%] 

0 

[Qa, Qil 

[Qo, Qd 

[Qo, Qil 

[Qi, Q 2 ] 

[Qi, 172] 

[Qol 

[Qo, Qil 


EXAMPLE 3.8 


Construct a deterministic finite automaton equivalent to 

M = ({q<> qu qy q?}, {0, i}), S. q 0 , { q 2 }) 

where 8 is given by Table 3.6. 


TABLE 3.6 State Table for Example 3.8 


State/I 

a 

b 

->Qo 

Qo= Qi 

Qo 

Qi 

Q2 

Qi 

Q2 

Q3 

Q3 

© 


Q 2 


Solution 

Let Q = {g 0 , q h q y q 3 }. Then the deterministic automaton M\ equivalent to 
M is given by 

Mi = (2 2 , {a, h}. 8, [q 0 ], F) 

where F consists of; 


tel- too- q?l t<7b q? J- [<?2- q?\- [<?o- q\, q?], [qo, <?:• 9?]. lq\, c ii- 43] 


and 


[q 0 . q u q 2 , 43 ] 


and where <5 is defined by the state table given by Table 3.7. 



84 a Theory of Computer Science 


TABLE 3.7 State Table of M 1 for Example 3.8 

State /1 

a 

h 

.fool 

too- c/il 

tod 

[£7o, Qil 

too. <7i. Qil 

too. <7iJ 

[Qo- 9i. %] 

too. <?i, <72. Qol 

t<7o. <7i. %] 

too. m. 

too. <?i. <h] 

too. <7i. %! 

too. Qi. <72, %] 

[(7o. <7i, <72, ft] 

too. <7i. <72..<7al 


3.8 MEALY AND MOORE MODELS 
3.8.1 Finite Automata with Outputs 

The finite automata which we considered in the earlier sections have binary 
output, i.e. either they accept the string or they do not accept the string. This 
acceptability was decided on the basis of reachability of the final state by the 
initial state. Now. we remove this restriction and consider the model where the 
outputs can be chosen from some other alphabet. The value of the output 
function Z(t) in the most general case is a function of the present state q(j) and 
the present input x(t\ i.e. 

Z(t) = Mq{r), x(t)) 

where A is called the output function. This generalized model is usually called 
the Mealy machine . If the output function Z(t) depends only on the present state 
and is independent of the current input, the output function may be written as 

Z(t) = Mo(f)) 

This restricted model is called the Moore machine . It is more convenient to use 
Moore machine in automata theory. We now give the most general definitions 
of these machines. 

Definition 3.8 A Moore machine is a six-tuple ( 0 . £, A. <5, A, q 0 ). where 

(i) 0 is a finite set of states: 

(ii) Z is the input alphabet; 

(iii) A is the output alphabet: 

(iv) 8 is the transition function Z x 0 into Q ; 

(v) A is the output function mapping Q into A; and 

(vi) < 7 0 is the initial state. 

Definition 3.9 A Mealy machine is a six-tuple (Q, E, A, <5, A> q$) f where 
all the symbols except A have the same meaning as in the Moore machine. A 
is the output function mapping I x Q into A. 

For example. Table 3,8 describes a Moore machine. The initial state q 0 is 
marked with an arrow. The table defines 8 and A, 
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TABLE 3.8 

A Moore Machine 


Present state 

Next state 6 

Output 


a = 0 

a = 1 

X 

“*9o 

93 

9^ 

0 

9i 

9i 

92 

1 

92 

92 

9s 

0 

93 

93 

9o 

0 


For the input string 0111, the transition of states is given by q 0 -4 q 3 -4 
q 0 —> ^ 7 i —> cjy The output string is 00010. For the input string A, the output 
is Mcjo) = 0. 

Transition Table 3.9 describes a Mealy machine. 


TABLE 3.9 A Mealy Machine 


Present state 


Next state 




a = 0 



a = 1 


state 

output 

state 


output 

-«7i 

93 

0 

92 


0 

92 

9 J 

1 

94 


0 

93 

92 

1 

9-! 


1 

94 

94 

1 

93 


0 


Note: For the input stung 0011. the transition of states is given by q x -4 q 3 
-4 q 2 -4 q 4 -4 q 3 , and the output string is 0100. In the case of a Mealy machine, 
we get an output only on the application of an input symbol. So for the input 
string A, the output is only A. It may be observed that in the case of a Moore 
machine, we get A(c/ 0 ) for the input string A. 

Remark A finite automaton can be converted into a Moore machine by 
introducing A = {0, 1} and defining k(q) = 1 if q e F and Mq) = 0 if 
q £ F. 

For a Moore machine if the input string is of length /?, the output string 
is of length n + 1. The first output is X(qq) for all output strings. In the case 
of a Mealy machine if the input string is of length n, the output string is also 
of the same length n. 


3.8,2 Procedure for Transforming a Mealy Machine 
into a Moore Machine 

We develop procedures for transforming a Mealy machine into a Moore 
machine and vice versa so that for a given input string the output strings are the 
same (except for the first symbol) in both the machines. 
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EXAMPLE 3.9 


Consider the Mealy machine described by the transition table given by 
Table 3.10. Construct a Moore machine which is equivalent to the Mealy, 
machine. 


TABLE 3.10 Mealy Machine of Example 3.9 


Present state 


Next 

state 


Input a = 

0 

Input a = 1 

state 

~~output 

state 

output 


Qz 

0 

<72 

0 

<72 

<?i 

1 

<74 

0 

Qz 

<72 

1 

<7i 

1 

<74 

Q4 

1 

<73 

0 


Solution 

At the first stage we develop the procedure so that both machines accept 
exactly the same set of input sequences. We look into the next state column 
for any state, say q r and determine the number of different outputs associated 
with q; in that column. 

We split q i into several different states, the number of such states being 
equal to the number of different outputs associated with c l{ . For example, in this 
problem. q { is associated with one output 1 and q 2 is associated with two 
different outputs 0 and 1. Similarly, q 3 and q 4 are associated with the outputs 
0 and 0. 1. respectively. So. we split q 2 into q 2 $ and q 2V Similarly, q 4 is split 
into q ±o and q 4l . Now Table 3.10 can be reconstructed for the new states as 
given by Table 3,11. 


TABLE 3.11 State Table for Example 3.9 


Present state 


Next , 

state 


Input a - 

0 

Input a - 1 


state 

output 

state 

output 

— 

<73 

0 

<?20 

0 

<720 

<7i 

A 

\ 

<?40 

0 

<721 

<7i 

t 

<?40 

0 

<73 

<721 

1 

<7l 

1 

<740 

<741 

1 

<73 

0 

<741 

<741 

4 

\ 

<73 

0 


The pair of states and outputs in the next state column can be rearranged as 
given by Table 3.12. 







Chapter 3: The Theory of Automata B 87 


TABLE 3.12 Revised State Table for Example 3.9 

Present state 

Next 

state 

Output 


a = 0 

a = 1 



93 

920 

1 

920 

9i 

940 

0 

<fcl 

9i 

940 

1 

93 

921 

9i 

0 

940 

941 

93 

n__ 

941 

Qa'. 

93 

i 


Table 3.12 gives the Moore machine. Here we observe that the initial state 
cji is associated with output 1. This means that with input A we get an output 
of 1, if the machine starts at state Thus this Moore machine accepts a zero- 
length sequence (null sequence) which is not accepted by the Mealy machine. 
To overcome this situation, either we must neglect the response of a Moore 
machine to input A, or we must add a new starting state q 0 , whose state 
transitions are identical with those of q x but whose output is 0. So Table 3.12 
is transformed to Table 3.13. 


TABLE 3.13 Moore Machine of Example 3.9 


Present state 

Next 

state 

Output 


a = 0 

a = 1 


“*9g 

9s 

920 

0 

9i 

9s 

920 

1 

920 

9i 

940 

0 

921 

9i 

940 

1 

9s 

921 

9i 

0 

940 

941 

9s 

0 

941 

941 

9s 

1 


From the foregoing procedure it is clear that if we have an m-output, n- 
state Mealy machine, the corresponding m -output Moore machine has no more 
than mn + 1 states. 


3.8.3 Procedure for Transforming a Moore Machine 
into a Mealy Machine 

We modify the acceptability of input string by a Moore machine by neglecting 
the response of the Moore machine to input A. We thus define that Mealy 
Machine M and Moore Machine M' are equivalent if for all input strings w, 
b7 X f(w) = Zy/'(vr), where h is the output of the Moore machine for its initial 
state. We give the following result: Let M\ = (Q, Z, A, <5, A, q 0 ) be a Moore 
machine. Then the following procedure may be adopted to construct an 
equivalent Mealy machine My 
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Construction 

(i) We have to define the output function X for the Mealy machine as 
a function of the present state and the input symbol. We define X by 

X{q y a) - MS(t} y a)) for all states q and input symbols a. 

(ii) The transition function is the same as that of the given Moore 
machine. 


EXAMPLE 3.10 


Construct a Mealy Machine which is equivalent to the Moore machine given 
by Table 3.14. 


TABLE 3.14 Moore Machine of Example 3.10 


Present state 

Next 

state 

Output 


a = 0 

a = 1 



9s 

9i 

0 

9i 

9i 

92 

1 

92 

92 

93 

0 

9s 

9s 

9o 

0 


Solution 

We must follow the reverse procedure of converting a Mealy machine into a 
Moore machine. In the case of the Moore machine, for every input symbol we 
form the pair consisting of the next state and the corresponding output and 
reconstruct the table for the Mealy Machine. For example, the states q 3 and c/| 
in the next state column should be associated with outputs 0 and 1, respectively. 
The transition table for the Mealy machine is given by Table 3.15. 


TABLE 3.15 Mealy Machine of Example 3.10 


Present state 



Next state 



a = 0 



a = 1 


state 

output 

state 


output 

-*Qq 

9s 

0 

9i 


1 

9; 

9i 

1 

92 


0 

92 

92 

0 

9s 


0 

93 

93 

0 

9o 


0 


Note: We can reduce the number of states in any model by considering states 
with identical transitions. If two states have identical transitions (i.e, the rows 
corresponding to these two states are identical), then we can delete one of them. 




Consider the Moore machine described by the transition table given by 
Table 3.16. Construct the corresponding Mealy machine. 





Chapter 3; The Theory of Automata B 89 


TABLE 3.16 Moore Machine of Example 3.11 

Present state 

Next state 

Output 


a = 0 a=1 


~rO A 

9i 92 

0 

92 

9i 9s 

0 

9s 

q. 92 

1 


Solution 

We construct the transition table as in Table 3.17 by associating the output 
with the transitions. 

In Table 3.17. the rows corresponding to q 2 and q 2 are identical So. we can 
delete one of the two states, re. q 2 or r/ 3 . We delete q 2 . Table 3.18 gives the 
reconstructed table. 


TABLE 3.17 Transition Table for Example 3.11 


Present state 



Next state 


a = 

0 

a = 1 


state 

output 

state 

output 

-*9i 

9i 

0 

92 

0 

92 

9i 

0 

93 

1 

93 

9i 

0 

93 

1 


TABLE 3.18 Mealy Machine of Example 3.11 


Present state 

A/ex? s?a?e 



a - 0 


a 

= i 

state 

output 

state 

output 

—>9i 9i 

0 

92 

0 

9 2 9i 

0 

92 

1 

In Table 3.18. we have deleted the 

rows. 

r/ 3 -row and replac 

ed by { 

r /2 in the other 


EXAMPLE 3.12 


Consider a Mealy machine represented by Fig. 3.10. Construct a Moore 
machine equivalent to this Mealy machine. 






1 /z 2 

Fig. 3.10 Mealy machine of Example 3.12. 


Solution 

Let us convert the transition diagram into the transition Table 3.19. For the 
given problem: q x is not associated with any output; q 2 is associated with two 
different outputs Z l and Z 2 ; q 3 is associated with two different outputs Z { and 
Z 2 . Thus we must split q 2 into q 2x and q 22 with outputs Zj and Z 2 . respectively 
and q 3 into q 3l and q 32 with outputs Z } and Z 2 , respectively. Table 3.19 may be 
reconstructed as Table 3.20. 


TABLE 3,19 Transition Table for Example 3.12 


Present state 



Next state 




a = 0 



a = 1 


state 


output 


stare output 

— ^<7i 

<72 


Z 1 


q-i Zy 

<72 

<72 


z 2 


<73 ^ 

<73 

<72 


Zl 


<73 Z 2 

TABLE 3.20 

Transition Table of Moore 

Machine for Example 3.12 

Present state 



Next state 


Output 



a - 0 


a = 1 




<721 


<731 


<72; 


<722 


<731 

Z 1 

<722 


<722 


<731 

Z 2 

<731 


<721 


<732 

Z, 

<732 


<721 


<?32 

Z 2 


Figure 3.11 gives the transition diagram of the required Moore machine. 



Chapter 3; The Theory of Automata S 91 



Fig. 3.11 Moore machine of Example 3.12. 


3.9 MINIMIZATION OF FINITE AUTOMATA 

In this section we construct an automaton with the minimum number of states 
equivalent to a given automaton M. 

As our interest lies only in stnngs accepted by M, what really matters is 
whether a state is a final state or not. We define some relations in Q. 

Definition 3.10 Tw 7 o states q x and q 2 are equivalent (denoted by q x s q 2 ) if 
both 8{q h x) and S(q 2 , x) are final states, or both of them are nonfinal states 
for all x e X*. 

As it is difficult to construct S(q h x) and S(q 2f x ) for all a: e X* (there 
are an infinite number of strings in X*)* we give one more definition. 

Definition 3.11 Two states q x and q 2 are ^'-equivalent (k > 0) if both 
8(q h x) and S(q 2 . x) are final states or both nonfinal states for all strings x 
of length k or less. In particular, any tw r o final states are 0-equivalent and any 
two nonfinal states are also 0-equivalent. 

We mention some of the properties of these relations. 

Property 1 The relations we have defined, i.e. equivalence and ^-equivalence, 
are equivalence relations, i.e. they are reflexive, symmetric and transitive. 

Property 2 By Theorem 2.1. these induce partitions of Q. These partitions 
can be denoted by 7rand n k * respectively. The elements of n k are ^-equivalence 
classes. 

Property 3 If q x and q 2 are ^-equivalent for all k > 0, then they are equivalent. 
Property 4 If q x and q 2 are (k + l)-equivalent. then they are ^-equivalent. 

Property 5 K n = n n+x for some n. (7t n denotes the set of equivalence classes 
under n -equivalence.) 

The following result is the key to the construction of minimum state 
automaton. 

RESULT Two states q x and q 2 are (k + l)-equivalent if (i) they are 
^-equivalent; (ii) 8{q x , a) and Siq^, a) are also ^-equivalent for every a € X. 
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Proof We prove the result by contradiction. Suppose q x and q 2 are not 
(k + l)-equivalent. Then there exists a string w = owj of length k+ 1 such that 
<5(<7 j, awj) is a final state and Siq^, aw is not a final state (or vice versa; the 
proof is similar). So S(S(q b a), w } ) is a final state and S(8(q 2 , a), w { ) is not 
a final state. As w, is a string of length k , a) and S(q 2 , a ) are not 

^-equivalent. This is a contradiction, and hence the result is proved. I 

Using the previous result we can construct the (k + Inequivalence classes 
once the ^-equivalence classes are known. 


3.9.1 Construction of Minimum Automaton 

Step 1 (Construction of %)* Bv definition of O-equivalence, n ft = {Q\ ,Q 2 ) 
where Q® is the set of all final states and Q® = Q - Q x • 

Step 2 (Construction of K k+X from /r /: ). Let Q k be any subset in K k . If q } and 
q 2 are in Q k , they are ( k + l)-equivalent provided 8(q b a) and 8(q % a) are 
^-equivalent. Find out whether 8(q b a ) and S(q 2) a) are in the same equivalence 
class in n k for every' a e X. If so, q x and q 2 are (k + 1 )-equivalent. In this way, 
Q k is further divided into ( k 4 - Inequivalence classes. Repeat this for every Q k 
in K k to get all the elements of K k+X . 

Step 3 Construct n n for n = 12..,.. until K n = n n +\. 

Step 4 (Construction of minimum automaton). For the required minimum 
state automaton, the states are the equivalence classes obtained in step 3, i.e. the 
elements of K 1V The state table is obtained by replacing a state q by the 
corresponding equivalence class [q]. 

Remark In the above construction, the crucial part is the construction of 
equivalence classes; for, after getting the equivalence classes, the table for 
minimum automaton is obtained by replacing states by the corresponding 
equivalence classes. The number of equivalence classes is less than or equal to 
| Q |. Consider an equivalence class [q x ] = {q x , If q\ is reached 

while processing w x w 2 e T(M) with 8(q 0f w x ) = q b then 8{q x , w 2 ) e F. So, 
8{q u w 2 ) e F for i - 2, . . h Thus we see that q b i = 2, . . k is reached on 
processing some w e T{M) iff q x is reached on processing w, i.e. q x of [q x ] can 
play the role of q 2 . . . q k . The above argument explains why we replace a state 
by the corresponding equivalence class. 

Note: The construction of 7Zq, k b n 2 , etc. is easy when the transition table 
is given. 7Cq = {Q x \ 0 2 }< where Q x = F and Q 2 = Q - F. The subsets in 
K\ are obtained by further partitioning the subsets of % If q b q 2 s O x , 
consider the states in each a-column. where a e X corresponding to q x and q 2 . 
If they are in the same subset of q x and q 2 are 1-equivalent. If the states 
under some a-column are in different subsets of ;r 0 . then q x and q 2 are not 
1-equivalent. In general, (k + l)-equivalent states are obtained by applying the 
above method for q { and q 2 in Qf. 
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EXAMPLE 3.13 


Construct a minimum state automaton equivalent to the finite automaton 
described by Fig. 3.12. 



Solution 

It will be easier if we construct the transition table as shown in Table 3.21. 


TABLE 3.21 Transition Table for Example 3.13 


Staten. 

0 

1 

~>Qc 

9i 

95 

9i 

96 

92 

® 

9o 

92 

93 

92 

96 

94 

97 

95 

9s 

92 

96 

9s 

96 

94 

9? 

96 

92 


By applying step 1, we get 

Q\ = F = {ft}. Qi = Q - Q ? 

So. 

ft = {{ft}- {ft- ft- ft. ft. ft- ft- ft}} 
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The {q 2 } in cannot be further partitioned. So, Q\ = { 97 }, Consider q 0 and 
q x e Q? . The entries under the 0-column corresponding to q 0 and q x are q } 
and q (y : they lie in Q 2 - The entries under the 1-column are 95 and q 2 . q 2 € 
Q? and q 5 e Q 2 . Therefore, q 0 and q x are not 1-equivalent. Similarly. q 0 is 
not 1 -equivalent to q 3> q 5 and q 7 . 

Now, consider q 0 and 94 . The entries under the 0-column are q x and 97. 
Both are in Q 2 . The entries under the 1-column are q 5 , q 5 . So c / 4 and q 0 are 
1-equivalent. Similarly. q Q is 1 -equivalent to q 6 . {90 94 , q 6 } is a subset in K\. 
So. = {?o, 94 , 96 }• 

Repeat the construction by considering q x and any one of the states 
q 3f q 5 . q 7 . Now, q x is not 1-equivalent to 93 or q 5 but 1-equivalent to q 7 . Hence, 
Q 3 = {q b q 7 }> The elements left over in Q 2 are q 3 and q 5 . By considering the 
entries under the 0 -column and the 1 -column, we see that 93 and 95 are 
1-equivalent. So Q 4 = { 93 , 95 }. Therefore, 

{%• <74. <76/- {<?!• 9?}. {<??• ft}} 

The { 92 } is also in k 2 as it cannot be further partitioned. Now. the entries 
under the 0-column corresponding to q 0 and q A are q x and q ly and these lie 
in the same equivalence class in K x . Tire entries under the 1-column are 95, 
q 5 . So 90 and q, x are 2-equivalent. But q 0 and q 6 are not 2-equivalent. Hence. 
{q 0> 94, 9 6 } is partitioned into {cj 0 , 94} and {q 6 }. q x and 97 are 2-equivalent. 
93 and 95 are also 2-equivalent. Thus. n 2 = {{92}, {90 94 ^ (960 ( 9 u 97 K 
{93, 95}}. 9 0 and 94 are 3-equivalent. The q x and 97 are 3-equivalent. Also, 
93 and 95 are 3-equivalent. Therefore. 

^3 = H 9 :}> { 9 o 94}. { 9 e)> {<?i> 97 h ( 93 > 95}} 

As 7 r 2 = 7 T 3 , tt 2 gives us the equivalence classes, the minimum state automaton 
is 

Af' = (2: {0. 1}. 5: 9o ? FO 

where 

G' = {[92]- [ 9 o> 94] > [Qel 1 . 9 b qil [Qi* 95]} 

9 o = [ 9 o> 94]- F'= [9:] 

and 5' is defined by Table 3.22. 


TABLE 3.22 Transition Table of Minimum State 
Automaton for Example 3.13 


StatelZ 

0 

1 

[< 7 o. Q4] 

[c?1. <77] 

toe- < 7 s] 

iQi, Qi] 

toe] 

tod 

tod 

too. <74 

tod 

to- <75] 

tod 

toe] 

m 

toei 

too. <74] 
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Note: The transition diagram for the minimum state automaton is described 
by Fig. 3.13. The states q 0 and q 4 are identified and treated as one state. 
(So also are q h q 1 and q 3 , q 5 ) But the transitions in both the diagrams (he. 
Figs. 3.12 and 3.13) are the same. If there is an arrow from to q; with label 
a . then there is an arrow from [#,■] to [qj\ with the same label in the 
diagram for minimum state automaton. Symbolically, if &q v a) = qj, then 
S'([q^ a) = [qj]. 



Fig. 3.13 Minimum state automaton of Exampie 3.13. 


EXAMPLE 3.14 


Construct the minimum state automaton equivalent to the transition diagram 
given by Fig. 3.14. 



Solution 

We construct the transition table as given by Table 3.23. 
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TABLE 3.23 Transition Table for Example 3.14 


State11 

a 

b 

q 0 

to 

to 

to 

to 

to 

to 

to 

to 

® 

to 

to 

to 

to 

to 

to 

to 

to 

to 

to 

to 

to 

Qe 

to 


Since there is only one final state q 3 , < 2 i° = {to?}, 0 2 ° = Q ~ 0 i° Hence, 
flb = {{<7 3 }, {g 0 , <7i, to to to to toll- As {a 3 } cannot be partitioned further, 
O' 1 = {<73}. Now q 0 is 1-equivalent to q\, <75, to t> ut not t0 to» to <?7, m d so 
O': = {to to to> to}* to is 1 -equivalent to <74. Hence, {/ 3 = {q^ to}- on ^Y 
element remaining in Q 2 ° is q 7 . Therefore, O4 = {^7}- Thus, 

*i = {{to}, {to> to to, to}, {to to}, {to}} 

0i 2 = {to} 


to is 2-equivalent to g 6 but not to q x or < 75 * So, 

O 2 2 = {to, to} 

As is 2-equivalent to < 75 , 

O 3 2 = {to* to} 

As <72 is 2-equivalent to < 74 , 


Thus. 


Qi - {to, to}, Qs - {to} 

^2 = {{to}, {to, to}, {to to}, {to, to}, {to}} 


As r/o is 3-equivalent to cj^. 

As q\ is 3-equivalent to q 5 , 

As q 2 is 3-equivalent to < 74 , 

0 / 

Therefore, 


O1 3 = {to} 

O2 3 = {to, to} 


Qs = {< 7 b <75} 


{<?2> 94 h 2s ? = {^7} 


^3 - {{^3}, {to, to}, {to> to}, {to to}, {to}} 

As 7r 3 = tt 2 , 7T 2 gives us the equivalence classes, the minimum state automaton 
is 

M' = (Q\ {a, b}, S', <70, n 
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where 

Q' = {feL [90 9&L lq\, 9 =]- [<?2, c ul [<??]} 

9o = [<?o. ^63. T' = tel 
and 5' is defined by Table 3.24. 


TABLE 3,24 Transition Table of Minimum State 
Automaton for Example 3.14 


Statell 

a 

b 

[9o. 9s] 

[ 91 . 95 ] 

[9o, 9sl 

[9i. 95] 

[9o. 9sl 

[92. 94] 

[92, 9d 

[93] 

[9i. 9s] 

[ 93 ] 

[ 93 ] 

[9o. 9el 

[ 97 ] 

{9o. 9s] 

[ 93 ] 


Note: The transition diagram for M' is given by Fig, 3.15. 


b 



Fig. 3.15 Minimum state automaton of Example 3.14. 

3.10 SUPPLEMENTARY EXAMPLES 


EXAMPLE 3.15 


Construct a DFA equivalent to the NDFA M whose transition diagram is given 
by Fig. 3.16. 



Fig. 3.16 NDFA of Example 3.15 
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Solution 


The transition table of M is 

given by Table 3.25. 


TABLE 3.25 

Transition Table for Example 3.15 

State 

a 

b 

~^Qq 

Qo 

Qo, Q'2 

Qi 

Q 3 

Q2, Q3 

0,2 

Q 4 

— 

® 

Q4 

— 

For the equivalent DFA: 



(i) The states are subsets of Q = {q ih q 2 , # 3 , c/ 4 }, 

(ii) [</ 0 ] is the initial state. 


(iii) The subsets of Q containing q 3 or <y 4 are the final states. 

(iv) <5 is defined by Table 3.26. We start from. [//,,] and construct S. only 

for those states reachable from [q 0 ] (as in Example 3.8). 

TABLE 3.26 

Transition Table of DFA for 
Example 3.15 


State 

a 

b 

tool 

Idol 

too. ft] 

too . ft] 

too. <? 4 l 

too- ft] 

too, < 74 ] 

tool 

too, ft] 




Construct a DFA equivalent to an NDFA whose transition table is defined by 

Table 3.27. 



TABLE 3.27 

Transition Table of NDFA for 


Example 3.16 


State 

a 

b 

Qo 

Qi, Qs 

Q2, Q 3 

Qi 

Qi 

Qs 

Q2 

© 

Qs 

Q2 


Solution 

Let M be the DFA defined by 

M = (2^0- «>- ft- ftl, {a, b }, & [cy 0 ], F) 
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where F is the set of all subsets of {q {] . q v q 2 . q^} containing q y 8 is defined 
by Table 3.28. 


TABLE 3.28 Transition Tabie of DFA for 
Example 3.16 


State 

a 

b 

faol 

fai. <73] 

[<? 2 > 93] 

fei. 9 zi 

fail 

fas] 

fail 

fail 

[93] 

fas] 

0 

0 

faz. <?3l 

fas] 

fa 2 ] 

[<fe] 

fas! 

fa 2 ] 

0 

0 

0 



Construct a DFA accepting all strings w over {0, 1} such that the number of 
l ? s in \v is 3 mod 4. 


Solution 

Let M be the required NDFA. As the condition on strings of T(M) does not 
at all involve 0. we can assume that M does not change state on input 0. If 1 
appears in vv (4k + 3) times. M can come back to the initial state, after reading 
4 Vs and to a final state after reading 3 Ls. 

The required DFA is given by Fig. 3.17. 



EXAMPLE 3.18 


Construct a DFA accepting all strings over {a. b) ending in ah. 

Solution 

We require two transitions for accepting the string ab. If the symbol b is 
processed after aa or ba. then also we end in ab. So we can have states for 
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remembering aa, ab , ba , bb. The state corresponding to afo can be the final 
state in our DFA. Keeping these in mind we construct the required DFA. Its 
transition diagram is described by Fig. 3.18. 



EXAMPLE 3.19 


Find 7TM) for the DFA M described by Fig. 3.19. 

b a 



Fig. 3.19 DFA of Example 3.19. 


Solution 

T{M) = {w e {cl b}* | w ends in the substring ab} 

Note: If we apply the minimization algorithm to the DFA in Example 3.18, 
we get the DFA as in Example 3.19. (The student is advised to check.) 


EXAMPLE 3.20 


Construct a minimum state automaton equivalent to an automaton whose 
transition table is defined by Table 3.29. 
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TABLE 3.29 DFA of Example 3.20 


State 

a 

b 

~^Qq 

4i 

42 

4i 

4i 

43 

42 

43 

44 

43 

4i 

45 

44 

44 

42 

® 

46 

46 


Solution 

Q] - 02 “ 4b C J 2 , 43> ^ 4 }' S°? *b ” {{^ 75 }^ {<7o^ 4b 42' 43^ 44}} 

Qj° cannot be partitioned further. So {£ 75 } € 7 ^. Consider Q 2 . 4o equivalent 
to <gq, q 2 and q± But cj 0 is not equivalent to q 2 since S(g 0 , b) = q 2 and 
5(43, 5) = {45}* 

Hence, 

Ci 1 = { 95 }. G : 1 = bo. </i- 92 . <?4}< G3 1 = {<7?> 

Therefore, 

*1 = {{95}- bo. 91 . 92 . 94}. {93}} 

< 7 0 is 2-equivalent to q A but not 2-equivalent to q x or q 2 . 

Hence, 

(4o> 44/ e K z 

q x and q 2 are not 2-equivalent. 

Therefore. 

^2 = {{95}- {93}. {90.94}. {91}. {92}} 

As q 0 is not 3-equivalent to q>, bo. q 4 ] is further partitioned into {^j} and 

{94}- 

So, 

X 3 = {{9o}. {9il. (9:}. { 93 }. { 94 }. {9.4} 

Hence the minimum state automaton M' is the same as the given M. 


EXAMPLE 3.21 


Construct a minimum state automaton equivalent to a DFA whose transition 
table is defined by Table 3.30. 

TABLE 3.30 DFA of Example 3.21 


State a b 


—>4o 

4i 

42 

4i 

44 

43 

42 

44 

43 

© 

4s 

46 

© 

47 

46 

45 

4s 

46 

4s 

4s 

46 

4? 

44 

46 
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Solution 

Q\ = iq?~ Q ( i = {%■ <?h ( h- (is, qi) 

n 0 = U<?3> 94}- {<?o- 9i- qi , 95’ 96’ 97}} 

is 1-equivalent to < 74 . So, {<y 3 . g 4 } e Tq. 

< 7 0 is not 1 -equivalent to q u q 2 , q 5 but q 0 is 1 -equivalent to q 6 . 

Hence {q& q $} € 7T { . q x is 1 -equivalent to q 2 but not 1-equivalent to 
or q r So, {q b <y 2 } 6 7l v 

q 5 is not 1 -equivalent to q 6 but to <y 7 . So, {^/ 5 , ty 7 } € 7q 
Hence, 

ff] = {{93- 94}- (90’ 96}’ (9i’ 9;}’ {95- 9?}} 
ty 3 is 2-equivalent to <y 4 . So, {<y 3 , <y 4 } 6 /r 2 . 

q$ is not 2-equivalent to q$. So, {<y 0 }. { q 5} € 7T 2 . 

tyj is 2-equivalent to cy 2 . So* {g b <y 2 } e 7T 2 . 

<y 5 is 2-equivalent to g 7 . So, {g 5 , qi} e K 2 . 

Hence* 

*2 = {{93’ 94}’ {90}’ {96}’ {9i’ 9:}’ {95’ 97}} 

<y 3 is 3 -equivalent to g 4 ; q { is 3 -equivalent to g 2 and q 5 is 3 -equivalenr to ry 7 . 
Hence* 

^3 = {{90}’ { q \’ 92}’ {93- 94}- {95’ 97}. {96}} 

As 7 T 3 = /r 2 , the minimum state automaton is 

M' = (Q' {a* b], 5' {[?3. 9J}) 

where S' is defined by Table 3.31* 


TABLE 3.31 Transition Table of DFA for 
Example 3.21 


State 

a 

h 

tool 

[9i. 92] 

[9i. 9 2 ] 

[Qi. <fe] 

[93 . 94! 

[93. 94] 

[QZ: 94] 

[95 , 97] 

[96] 

[<7s, 9?] 

[9s. 94] 

[96 ] 

[96] 

[9e] 

[9eJ 
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SELF-TEST 

Study the automaton given in Fig. 3.20 and choose the correct answers 
to Questions 1-5: 



Fig. 3.20 Automaton for Questions 1-5. 


1. M is a 

(a) nondeterministic automaton 

(b) deterministic automaton accepting {0. 1}* 

(c) deterministic automaton accepting all strings over {0. 1} having 
3 m 0 ? s and 3 n 1/s. m, n > 1 

(d) deterministic automaton 

2. M accepts 

(a) OHIO (b) 10001 (c) 01010 (d) 11111 

3. T(M ) is equal to 

(a) {0 : ' m l 3n | m<n> 0} 

(b) {0 3m l 3 w |m. n > 1} 

(c) {w |vv has 111 as a substring} 

(d) {vr|u’ has 3 n l’s. n > 1} 

4. If qi is also made a final state, then M accepts 

(a) OHIO and 01100 

(b) 10001 and 10000 

(c) 0110 but not 0111101 

(d) OH n > 1 but not 1H n > 1 

5. If q 2 is also made a final state, then T(M) is equal to 

(a) {0 3w l 3n | m 9 n > 0} u {0 2 ” 1 l n | m. n > 0} 

(b) {0 3m V n | »h n > 1} u {0 Zm l n | m, n > 1} 

(c) {vv | >v has 111 as a substring or 11 as a substring} 

(d) {w | the number of l’s in w is divisible by 2 or 3} 

Study the automaton given in Fig. 3.21 and state whether the Statements 
6-15 are true or false: 



Fig. 3.21 Automaton for Statements 6-15. 
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6 . M is a nondeterministic automaton. 

7. 8 (a h 1) is defined. 

8. 0100111 is accepted by M. 

9. 010101010 is not accepted by M . 

10. 8 (q 0 . 01001) = {q x }. 

11 . 8 (q 0 , 011000) = {q 0 , Qu Qi)- 

12. 8 (q 2 , w) = q 2 for any string w e {0. 1}*. 

13. %i, 11001) * 0. 

14. T(M) = {w \ w = jc00y, where x, y e {0, 1}*}. 

15. A string having an even number of 0’s is accepted by M. 

EXERCISES 

3.1 For the finite state machine M described by Table 3.1, find the strings 
among the following strings which are accepted by M: (a) 101101, 
(b) 11111, (c) 000000. 

3.2 For the transition system M described by Fig. 3.8, obtain the sequence 
of states for the input sequence 000101. Also, find an input sequence not 
accepted by M. 

3.3 Test whether 110011 and 110110 are accepted by the transition system 
described by Fig. 3.6. 

3.4 Let M - (Q, X. <5, q 0 , F) be a finite automaton. Let R be a relation in 
Q defined by q\Rq 2 if 8 (q\, a) = 8 (q 2 , a) for all a e X. Is R an 
equivalence relation? 

3.5 Construct a nondeterministic finite automaton accepting {ab, ba }, and 
use it to find a deterministic automaton accepting the same set. 

3.6 Construct a nondeterministic finite automaton accepting the set of all 
strings over {a, b) ending in aba. Use it to construct a DFA accepting the 
same set of strings. 

3.7 The transition table of a nondeterministic finite automaton M is defined 
by Table 3.32. Construct a deterministic finite automaton equivalent to M. 

TABLE 3.32 Transition Table for Exercise 3.7 
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3*8 Construct a DFA equivalent to the NDFA described by Fig. 3.8. 

3*9 M = ({<?]. q 2 , q 2 ), {0, 1}, 8, q 1? {q 2 }) is a nondeterministic finite 
automaton, where 8 is given by 

8(q x . 0) = {q 2 , q,}, 1) = fo} 

< 5 ( 9 3 0) = {9,, 172}' <S(< 7 ;, 1 ) = 0 

<5(9?, 0 ) = { 9 ?}, < 5 ( 93 . 1 ) = (9i. 92} 

Construct an equivalent DFA. 

3*10 Construct a transition system which can accept strings over the alphabet 
a y b f ... containing either cat or rat, 

3*11 Construct a Mealy machine which is equivalent to the Moore machine 
defined by Table 3.33. 

TABLE 3.33 Moore Machine of Exercise 3.11 


Present state 

Next 

state 

Output 


o 

II 

CD 

a = 1 


— ><7o 

9i 

92 

1 


93 

92 

0 

92 

92 

9i 

1 

93 

9o 

93 

1 


3*12 Construct a Moore machine equivalent to the Mealy machine M defined 
by Table 3.34. 


TABLE 3.34 Mealy Machine of Exercise 3.12 


Present state 


Next 

state 


a = 0 


a 

= 1 

state 

output 

state 

output 

->9i 

9i 

1 

92 

0 

92 

94 

1 

94 

1 

93 

92 

1 

93 

1 

94 

93 

0 

9i 

1 


3.13 Construct a Mealy machine which can output EVEN, ODD according 
as the total number of Ts encountered is even or odd. The input 
symbols are 0 and 1. 

34 4 Construct a minimum state automaton equivalent to a given automaton 
M whose transition table is defined by Table 3.35. 




106 B Theory of Computer Science 


TABLE 3.35 Finite Automaton of Exercise 3.14 


State 

Input 

a b 

— > Qo 

9o 

9s 

9 i 

92 

95 

92 

93 

94 

93 

9o 

95 

94 

9o 

96 

95 

9i 

94 

® 

9i 

9s 


3.15 Construct a minimum state automaton equivalent to the DFA described 
by Fig. 3.18. Compare it with the DFA described by Fig. 3.19. 

3.16 Construct a minimum state automaton equivalent to the DFA described 
by Fig. 3.22. 




Formal Languages 



In this chapter we introduce the concepts of grammars and formal languages 
and discuss the Chomsky classification of languages. We also study the 
inclusion relation between the four classes of languages. Finally, we discuss the 
closure properties of these classes under the various operations. 


4.1 BASIC DEFINITIONS AND EXAMPLES 

The theory of formal languages is an area with a number of applications in 
computer science. Linguists were trying in the early 1950s to define precisely 
valid sentences and give structural descriptions of sentences. They wanted to 
define a formal grammar (i.e. to describe the rules of grammar in a rigorous 
mathematical way) to describe English. They thought that such a description of 
natural languages (the languages that we use in everyday life such as English, 
Hindi. French, etc.) would make language translation using computers easy. It 
was Noam Chomsky who gave a mathematical model of a grammar in 1956. 
Although it was not useful for describing natural languages such as English, it 
turned out to be useful for computer languages. In fact, the Backus-Naur form 
used to describe ALGOL followed the definition of grammar (a context-free 
grammar) given by Chomsky. 

Before giving the definition of grammar, we shall study, for the sake of 
simplicity, two types of sentences in English with a view to formalising the 
construction of these sentences. The sentences we consider are those with a 
noun and a verb, or those with a noun-verb and adverb (such as ‘Ram ate 
quickly' or ’Sam ran'). The sentence 'Ram ate quickly' has the words ‘Ram’, 
'ate', 'quickly’ written in that order. If we replace ‘Ram’ by ‘Sam’. 'Tom’, 
‘Gita’, etc. i.e. by any noun, 'ate’ by ‘ran*, 'walked', etc. i.e, by any verb in the 
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past tense, and ’quickly’ by ‘slowly\ ue. by any adverb, we get other 
grammatically correct sentences. So the structure of ’Ram ate quickly' can be 
given as (noun) (verb) (adverb). For (noun) we can substitute ‘Ram’, ‘Sam’. 
’Tom'. ‘Gitak etc. . Similarly, we can substitute ’ate*, ‘walked’, ‘rank etc. for 
(verb), and ‘quickly\ ‘slowly’ for (adverb). Similarly, the structure of ‘Sam ran’ 
can be given in the form (noun) (verb). 

We have to note that (noun) (verb) (adverb) is not a sentence but only the 
description of a particular type of sentence. If we replace (noun), (verb) and 
(adverb) by suitable words, we get actual grammatically correct sentences. Let 
us call (noun), (verb), (adverb) as variables. Words like ‘Ram’, ‘Sam’, fi ate\ 
‘rank ’quicklyk ‘slowly’ which form sentences can he called terminals. So our 
sentences turn out to be strings of terminals. Let S be a variable denoting a 
sentence. Now. we can form the following rules to generate two types of 
sentences: 

S —> (noun) (verb) (adverb) 

S (noun) (verb) 

(noun) —» Sam 
(noun) Ram 
(noun) —» Gita 
(verb) —> ran 
(verb) —? ate 
(verb) —» walked 
(adverb) -o slowly 
(adverb) —> quickly 

(Each arrow represents a rule meaning that the word on the right side of the 
aiTow can replace the word on the left side of the arrow.) Let us denote the 
collection of the rules given above by P. 

If our vocabulary is thus restricted to ‘Ram’, ‘Sam’, ‘Gita’, ‘atek ‘ran’, 
‘walked*, ’quickly* and ‘slowly*, and our sentences are of the form (noun) 
(verb) (adverb) and (noun) (verb), we can describe the grammar by a 4-tuple 
(V N , Z, P, 5). where 

V.y = {(noun), (verb), (adverb)} 

Z = {Ram, Sam, Gita. ate. ran. walked, quickly, slowly} 

P is the collection of rules described above (the rules may be called 
productions), 

S is the special symbol denoting a sentence. 

The sentences are obtained by (i) starting with S, (ii) replacing words 
using the productions, and (iii) terminating when a string of terminals is 
obtained. 

With this background we can give the definition of a grammar. As 
mentioned earlier, this definition is due to Noam Chomsky. 
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4.1.1 Definition of a Grammar 

Definition 4.1 A phrase-structure grammar (or simply a grammar) is 
(Tv, Z, P, S)s where 

(i) Tv is a finite nonempty set whose elements are called variables, 

(ii) Z is a finite nonempty set whose elements are called terminals, 

(iii) Tv n Z - 0. 

(iv) 5 is a special variable (i.e. an element of V( v ) called the start symbol, 
and 

(.v) P is a finite set whose elements are a —> / 3 . where a and are strings 
on Tv w- Z. a has at least one symbol from Tv- The elements of P are 
called productions or production rules or rewriting rules. 

Note: The set of productions is the kernel of grammars and language 
specification. We observe the following regarding the production rules. 

(i) Reverse substitution is not permitted. For example, if S -4 AB is a 
production, then we can replace S by AB, but we cannot replace AB 
by 5 . 

(ii) No inversion operation is permitted. For example, if S -4 AB is a 
production, it is not necessary that AB 4 5 is a production. 


EXAMPLE 4.1 


G = (Tv P> S) is a grammar 

where 

Tv= {(sentence), (noun), (verb), (adverb)} 

Z = [Ram, Sam. ate. sang, well] 

S = (sentence) 

P consists of the following productions: 

(sentence) -4 (noun) (verb) 

(sentence) -4 (noun) (verb) (adverb) 

(noun) -4 Ram 
(noun) -4 Sam 
(verb) -4 ate 
(verb) —> sang 
(adverb) -4 well 

Notation: (i) If A is any set. then A* denotes the set of all strings over A. 
A + denotes A* - {A}, where A is the empty string. 

(ii) A, B, C, A], A 2 . ... denote the variables. 

(in ) cl b . c\ . . . denote the terminals. 

(iv) x v. u\ . . . denote the strings of terminals. 

(v) a, /?, y ... denote the elements of (Tv u £)*• 

(vi) X° = A for any symbol X in Tv ! w Z. 
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4.1.2 Derivations and the Language Generated 
by a Grammar 

Productions are used to derive one string over V N u Z from another string. 
We give a formal definition of derivation as follows: 

Definition 4.2 If a —> (3 is a production in a grammar G and y, S are any 
two strings on V N u X, then we say that ya<5 directly derives y/38 in G (we 
write this as yah => y/3<5). This process is called one-step derivation. In 

G 

particular, if or —^ /3 is a production, then a => j3. 

G 

Note; If a is a part of a string and a -> j3 is a production, we can replace 
a by [3 in that string (without altering the remaining parts). In this case we 
say that the string we started with directly derives the new string. 

For example, 

G - ({5}, (0, 1}, {S 051, 5 01}, 5) 

has the production 5 —» 051. So, S in 0 4 51 4 can be replaced by OS I. The 
resulting string is 0 4 0S11 4 . Thus, we have 0 4 S1 4 =>Q 4 05I1 4 . 

Note: =^> induces a relation R on (V v u X)*, i.e. aR/3 if a => /1 

G G 

Definition 4.3 If a and /? are strings on V' v u X, then we say that a derives 

j3 if a => j3. Here => denotes the reflexive-transitive closure of the relation 

g g G 

in (V'v u X)* (refer to Section 2.1.5). 

Note: We can note in particular that a => a. Also, if a [3, a ^ j3, then 

G G 

there exist strings a b a 2 . ..a 72 , where n > 2 such that 
a = => a 2 r=> 0C3 . •. => = /3 

G G G 

When a => (3 is in n steps, we write a => B. 

G G 

Consider, for example, G = ({5}, {0. I}, {5 —» 051, 5 —» 01}, 5). 

As S => 051 => 0 2 S1 2 => 0 3 S1 3 , we have S => 0 3 S1 3 . We also have 
GGg g 

0 3 51 3 => 0 3 51 3 (as a 4 a). 

G q 

Definition 4.4 The language generated by a grammar G (denoted by L(G)) is 
defined as {w e X* | 5 ^ h }. The elements of L(G) are called sentences . 

G 

Stated in another way, L(G) is the set of all terminal strings derived from 
the start symbol 5. 

Definition 4.5 If S a, then a is called a sentential form. We can note 
that the elements of L(G) are sentential forms but not vice versa. 
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Definition 4.6 G, and G 2 are equivalent if L(G{) ~ L(G : ). 

Remarks on Derivation 

1. Any derivation involves the application of productions. When the 

number of times we apply productions is one, we write a => (3\ when 

* „ * G 

it is more than one, we write a => p (Note: a ^ a). 

G G 

2. The string generated by the most recent application of production is 
called the working string, 

3. The derivation of a string is complete when the working string cannot 
be modified. If the final string does not contain any variable, it is a 
sentence in the language. If the final string contains a variable, it is a 
sentential form and in this case the production generator gets ‘stuck’. 

Notation: (i) We write a p simply as a => /3 if G is clear from the context. 

G 

(ii) If A —» a is a production where A e K v , then it is called an 
A-production. 

(iii) If A —> a { .A —» a 2 , . . .. A a m are A-productions. these 
productions are written as A -a cq j a 2 j ... | <x nv 

W T e give several examples of grammars and languages generated by them. 


EXAMPLE 4.2 


If G = ({5}. {0. 1}. {5 -a 051, 5 -a A}. 5), find L(G), 

Solution 

As 5 -a A is a production, S => A. So A is in L(G). Also, for all n > 1, 

G 

S => 051 => 0 2 51 2 =>...=> 0”5r ! => 0 7 T" 

C G G G G 

Therefore, 

0 7 T 7 6 1(G) for ft > 0 

(Note that in the above derivation, S —» 051 is applied at every r step except 
in the last step. In the last step, we apply S —> A). Hence. {0 7i l" | n > 0} c LXG). 

To show that L(G) c {O”! 77 ] n > 0}, we start with w in L(G). The 
derivation of vv starts with S. If S —> A is applied first, we get A. In this case 
vr = A. Otherwise the first production to be applied is 5 -a 051. At any stage 
if we apply 5 -a A, we get a terminal string. Also, the terminal string is 
obtained only by applying 5 -a A. Thus the derivation of w is of the form 

5 => o n Sl u => 0'T 1 for some n > I 
g g 


i.e. 


L(G) c {0 77 r|ft > 0} 
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Therefore, 


L(G) - {0"l n |n > 0} 



If G = ({5}, {a}, {S —» 55}, 5), find the language generated by G . 

Solution 

L(G) - 0. since the only production 5 —> 55 in G has no terminal on the 
right-hand side. 


EXAMPLE 4.4 


Let G = ({5, C}, {<3, £>}, 5), where P consists of 5 aCa , C c/Ca | Z?. 

Find L(G). 

Solution 


5=^ aCa aba. So #Z?a € L(G) 

5 => aCa (by application of 5 —» aCa) 

~> (by application of C —> aCa ( n - 1) times) 

=> ci n ba n (by application of C h) 

Hence. a u ba n e L(G ), where n > 1. Therefore. 

{a n hcf | n > 1} c L(G) 

As the only 5-production is 5 —> aCa, this is the first production we have 
to apply in the derivation of any terminal string. If we apply C —> b. we get aba. 
Otherwise we have to apply only C —» aCa, either once or several times. So 
we get a lt Ca n with a single variable C To get a terminal string we have to 
replace C by b y by applying C —» b. So any derivation is of the form 

5 ci n ba n with n > 1 


Therefore, 

Thus. 


L(G) c { a !l ba n \ n > 1} 
L(G) = {a n ba n | n > 1} 


EXERCISE Construct a grammar G so that L(G ) = { d l ba m | m > 1}. 


Remark By applying the convention regarding the notation of variables, 
terminals and the start symbol, it will be clear from the context whether a 
symbol denotes a variable or terminal. We can specify a grammar by its 
productions alone. 
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EXAMPLE 4.5 


If G is 5 —> aS | bS \ a \ b, find L(G). 

Solution 

We show that L(G ) = {cl Z?} \ As we have only two terminals a, b, 
L(G) c {cl b}*. All productions are 5-productions, and so A can be in L(G) 
only when 5 —> A is a production in the grammar G. Thus. 

LAG) c {a b}* - {A} = {a f b}+ 

To show {cl b} + c L(G), consider any string a { a 2 . . . where each a } 
is either a or b. The first production in the derivation of a { ci 2 . *. ci n is 5 —> 
aS or 5 —> hS according as a\ = a or ci\ = b. The subsequent productions are 
obtained in a similar way. The last production is S a or S —» h according 
as a n ~ a or a n = b. So ci\ci 2 . . . a n s L(G). Thus, we have L(G) - {a, b} + . 

EXERCISE If G is 5 -4 aS | cl then show that L(G) = { a } + . 

Some of the following examples illustrate the method of constructing a 
grammar G generating a given subset of strings over L The difficult part is the 
construction of productions. We try to define the given set by recursion and then 
develop productions generating the strings in the given subset of L*. 


EXAMPLE 4.6 


Let L be the set of all palindromes over {a , b}. Construct a grammar G 
generating L 

Solution 

For constructing a grammar G generating the set of all palindromes, we use 
the recursive definition (given in Section 2.4) to observe the following: 

(i) A is a palindrome. 

(ii) a . b are palindromes. 

(iii) If ,r is a palindrome qxcl then bxb are palindromes. 

So we define P as the set consisting of: 

ii) 5 A 

(ii) 5 a and 5 —> b 

(iii) 5 —^ ciSci and 5 —^ bSb 

Let G = ({SI {a, b} t P, S). Then 

5 => A, 5 =» a, S => h 

The, Tore. 

A. cl b e L(G) 

If .v is a palindrome of even length, then a* = a^a 2 . . . a m a m . . . ci u where 
each ci; is either a or b. Then 5 => aui 2 . . a }V; a m a m _\ ... cl. by applying 
S aSa or 5 bSb. Thus, x e L(G). 
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If x is a palindrome of odd length, then x = a ^2 . . . , , . a L . where 

ays and c are either a or b. So 5 => a\ . . . tf /7 5<x, . . . <7] * by applying 

5 -» aSa, 5 hSh and finally, S —» a or S —> b. Thus, x e L(G). This proves 
L = L(G>. 


EXAMPLE 4.7 


Construct a grammar generating L = {wcw r ] w e {a, &}*}. 

Solution 

Let G = ({5}, {a, b, c), P, S), where P is defined as 5 -> aSa | bSb \ c. It 
is easy to see the idea behind the construction. Any string in L is generated 
by recursion as follows; (i) c e L: (ii) if .v e L. then wxw 1 £ L. So, as in 
the earlier example, we have the productions S —» aSa \ bSb | c\ 


EXAMPLE 4.8 


Find a grammar generating L = {a"b"c‘ \ n >. 1. i > 0}. 

Solution 

L = Li u L 2 
Li = { a"b” | n > 1} 
l 2 = {a"b”c‘ | n > 1. i > 1} 

We construct L[ by recursion and L 2 by concatenating the elements of L { 
and c\ / > 1. We define P as the set of the following productions: 

S —» A, A —> ab, A —» aAb, S —» Sc 

Let G = ({S, A}, {a, b, c}, P. S). For n > L i —> 0. we have 

5 Sc 1 => Ac 1 => a’^Ab’^c' => a"- 1 abb"~ ] c = a"b”c i 

Thus, 

{a n b n c j | n > 1. £ > 0} c L(G) 

To prove the reverse inclusion, we note that the only 5-productions 
are 5 —» Sc and 5 —» A. If we start with 5 —> A, we have to apply 

A => d 7 ~ x Ab n ~ { => c£ 7 b n * and so a }l b 71 c° € L(G) 

If we start with 5 —> 5c, we have to apply S Sc repeatedly to get Sc 1 . But 
to get a terminal string, we have to apply 5 —> A. As A a n b n , the resulting 
terminal string is a n b u c l . Thus, we have shown that 

L(G) c {aWc* \ n > 1. / > 0} 

Therefore. 

1(G) = {a n b ,l c l \n > L i > 0} 
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EXAMPLE 4.9 


Find a grammar generating {a j b u c }1 \ n > 1, j > 0}. 

Solution 

Let G = ({5, A}, {a, Z?, c}, P, 5), where P consists of S -4 5 -4 A, 

A -4 Me | Pc. As in the previous example, we can prove that G is the required 
grammar. 


EXAMPLE 4.10 


Let G = ({5, Ay}, {0. L 2}. P, S ), where P consists of S -4 0SA]2, 5 -4 012. 
2A] -4 A } 2, lAj -4 11. Show that 

L(G) = {0"i n 2"|/i > 1} 

Solution 

As S -4 012 is a production, we have S => 012, i.e, 012 e L(G). 


Also. 



5 =?> 

by applying S —» 05A]2 

(n - 1) times 

=> 0” 12(A 1 2)" _1 

by applying 5 -4 012 


i OnAl’-'T 

by applying 2A] -4 A.2 

several times 


by applying 1A-. -4 11 

(n - 1) times 

Therefore. 




€ L(G) for all n > 1 



To prove that L(G) c {0 /I l"2 H | n > 1}. we proceed as follows: If the first 
production that we apply is S —> 012. we get 012. Otherwise we have to apply 
S -4 0SAj2 once or several times to get 0"“” 1 .S(Ai2)"“ l . To eliminate S , we have 
to apply 5 -4 012. Thus we arrive at a sentential form 0 n l2(A[2) n '\ To 
eliminate the variable A lt we have to apply 2Aj -4 A{2 or lAj -4 11. Now, 
2A| —> Ayl interchanges 2 and Aj. Only 1Aj -4 11 eliminates A]. The sentential 
form we have obtained is 0 7 T2A } 2A]2 . . . Aj2. If we use 1A i -4 11 before 
taking all 2’s to the right, we will get 12 in the middle of the string. The A i , s 
appearing subsequently cannot be eliminated. So we have to bring all 2’s to the 
right by applying 24] -4 Aj2 several times. Then we can apply 1A { -4 11 
repeatedly and get 0 /? 1" 2 n (as derived in the first part of the proof). Thus. 

L(G) c {0 n l n 2 n \ n > 1} 

This shows that 

L(G) = {0 n r2 n \ n > 1} 

In the next example we construct a grammar generating 

{ a n b n c n | n > 1} 
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EXAMPLE 4.11 


Construct a grammar G generating {a n b n c ll \ n >1}. 

Solution 

Let L - {a n b n c n | n >1}. We try 7 to constmct L by recursion. We already know 
how to construct a n b n recursively. 

As it is difficult to construct a n b n c n recursively, we do it in two stages: 
(i) we construct d l d l and (ii) we convert d l into b n d\ For stage (i), we can have 
the following productions S aSa | aa. A natural choice for a (to execute 
stage (ii)) is be. But converting ( bc) n into b n c 11 is not possible as (bef has no 
variables. So we can take a - BC, where B and C are variables. To bring 
B 's together we introduce CB —» BC. We introduce some more productions 
to convert B’s into d s and C s into c's. So we define G as 


G = ({S, B, C}, {a, b, c}, P, S ) 

where P consists of 


S —^ aSBC j aBC, CB —^ BC, ciB —^ ab, bB —} bb, bC —^ be, cC —^ cc 

S => aBC => abC abc 


Thus. 

Also, 

S =i> a"~ l S(BC)" M 
=> a"~ l aBC(BQ n ~' 
a n B" C” 

=> a n ~ l abB"- ] C" 

=> a ,! b"C" 

=> a"b"~ l bcC"~ l 
=> d'b n c" 


abc e L(G) 


by applying 5 —» aSBc 

by applying S —» aBC 

by applying CB —» BC 
(since CB —» BC 
interchanges B and C) 

by applying cB —■> afc 

by applying bB —> Bfc 

by applying bC —> fee 

by applying cC —> cc 


{n - 1) times 

several times 


once 

several times 
once 

several times 


Therefore, 


L(G) c {a n b ,l c”\n > 1} 

To show that {a n b n c n \ n > 1} c L(G), it is enough to prove that the only 
way to arrive at a terminal string is to proceed as above in deriving a n b n c n 
(n > 1). 

To start with, we have to apply only S-production, If we apply S aBC ' 
first we get abc. Otherwise we have to apply S —> aSBC once or several times 
and get the sentential form a n ~ ] S(BQ u ~\ At this stage the only production we 
can apply is S —> aBC, and the resulting string is a n (BC) n . 
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In the derivation of d l b n c n , we converted all ZTs into V s and only then 
converted Cs into c s. We show that this is the only way of arriving at a 
terminal string. 

a‘\BC) n is a string of terminals followed by a string of variables. The 
productions w T e can apply to a n (BC) n are either CB BC or one of aB —> ah, 
bB —> bb, bC —» be , cC —> cc. By the application of any one of these 
productions, the we get a sentential form which is a string of terminals followed 
by a string of variables. Suppose a C is converted before converting all ZTs. 
Then we have a n (BQ n a n b ! ca ; where / < n and a is a string of B’s and Cs 
containing at least one 5. In o n b l ca , the variables appear only in a As c appears 
just before a, the only production we can apply is cC —> cc, If a starts with 
B, we cannot proceed. Otherwise we apply cC —> cc repeatedly until we obtain 
the string of the form d l b l c J Ba\ But the only productions involving B are 
aB ah and bB bb. As B is preceded by c in d l b?bBa\ we cannot convert 
B } and so we cannot get a terminal string. So L(G) c {a n b n c n j /z > 1}. Thus, 
we have proved that 

L(G) = {a n b n c u | n > 1} 


EXAMPLE 4.12 


Construct a grammar G generating {xx\x € {a, &}*}. 


Solution 

We construct G as follows: 


G = ({S, 5 b Sy 5 3 . A, B }, {a, b}, P, S) 


where P consists 

of 


Pi 

: 5 —> 

5i5 2 5; 

p z , p 3 

: S X S 2 

aS\A, 

Pa, p f 

: A5 3 -> 

S 2 aS 3 , 

p-, p 8 . p 9 

: Aa —» 

aA, 

Pio, P 11 

: <xS 2 

S 2 a, 

Pv, Pn 

: ^ 1^2 

A. 


S X S 2 -> bS\B 
BS 3 -4 S 2 bS 3 

Ab —> bA, Ba —» aB, Bb —> bB 

bS 2 -» S 2 b 
S 3 -» A 


Remarks The following remarks give us an idea about the construction of 
productions P\-P 13 . 

1. Pi is the only 5-production. 

2. Using S)S 2 —> < 75 [A, we can add terminal a to the left of 5j and variable 
A to the right. A is used to make us remember that we have added the 
terminal a to the left of S b Using AS : , S 2 aSy we add a to the right 
of S 2 . 
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3. Using S X S 2 —» bSiB, we add b to the left of 5) and variable B to the 
right. Using BS 3 —» 5 2 &S 3 , we add £> to the right of S 2 . 

4. S 2 acts as a centre-marker. 

5. We can add terminals only by using P 2 -P 5 . 

6 . Pfi—Pg simply interchange symbols. They push A or B to the right. This 
enables us to place A or B to the left of S 3 . (Only then we can apply 
P 4 or P 5 .) 

7. SiS 2 — > A, S 3 — > A are used to completely eliminate S 1? Sy S 3 . 

8 . P 10 , P\\ are used to push S 2 to the left. This enables us to get S 2 to 

the right of Sj (so that we can apply P 12 ). 

Let L - {xx | .x e {a, £}*}. We first prove that L c L(G). Now, we have 

S => S { S 2 S 3 => aS { AS 3 => aS&aSi (4.1) 

or 

S => S { S 2 S 3 => bSiBS 3 => bS ] S 2 bS 3 (4.2) 

Let us start with xx with x e {ab}*. We can apply (4.1) or (4.2), 
depending on the first symbol of x. If the first two symbols in x are ab (the 
other cases are similar), we have 

S —> aSiS 2 aS 3 abS\ BaS 3 => abS\aBS 3 abSi aS 2 bS 3 => abSiS 2 abS 3 

Repeating the construction for every 7 symbol in x, we get xS i S 2 xS 3 . On 
application of P 12 and P 13 , we get 

S => xS i S 2 xS 3 => xAxA = xx 

Thus, L c L(G). 

To prove that L(G) c L, we note that the first three steps in any derivation 
of L(G) are given by (4.1) or (4.2). Thus in any derivation (except S —> A), we 
get aS]S 2 aS 3 or bS\S 2 bS 3 as a sentential form. 

We can discuss the possible ways of reducing aS\S 2 aS 3 (the other case is 
similar) to a terminal string. The first production that we can apply to aSiS 2 aS 3 
is one of S[S 2 —^ A, S 2 —^ A, S{S 2 —^ aS\A, S\S 2 —^ bS\B. 

Case 1 We apply S\S 2 —> A to aSiS 2 aS 3 . In this case we get aAaSy As the 
productions involving S 3 on the left are P 4 , P 5 or P 13 , we have to apply only 
S 3 —> A to aaS 3 and get aa <= L. 

Case 2 We apply S 3 —» A to aS\S 2 aSy In this case we get aSiS 2 aA. If we 
apply S\S 2 —> A, we get aAaA = aa e L; or we can apply SiS 2 —> aS\A to 
aS\S 2 a to get aaS\Aa. In the latter case, we can apply only Aa aA to aaS\Aa. 
The resulting string is aaS\aA which cannot be reduced further. 

From Cases 1 and 2 we see that either we have to apply both P 12 and P l3 
or neither of them. 

Case 3 In this case we apply SiS 2 —» aS { A or S { S 2 —> bS Y B. If we apply 
S]S 2 —> aS x A to aSyS 2 aSy we get aaS^AaSy By the nature of productions we 
have to follow only aaS\AaS 3 => aaS\aAS 3 => a~S { aS 2 aS 3 =» a"SiS 2 a~Sy If 
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we apply S X S 2 -» bS\B, we get abS x S 2 abS 2 . Thus the effect of applying 
SiS 2 aSjA is to add a to the left of S]S 2 and S 3 . 

If we apply S]S 2 —» A, S 3 —» A (By Cases 1 and 2 we have to apply both) 
we get abab e L Otherwise, by application of P 2 or Ps> we add the same 
terminal symbol to the left of SiS 2 and S 3 . The resulting stnng is of the form 
jcSiS-tJcSa. Ultimately, we have to apply Pp and Pp and get xAxA = xx € L. So 
L(G)~ c L. Hence, L(G) = L. 


EXAMPLE 4.13 


Let G = ({S, A b A 2 }, {a, Z?}, P, S), where P consists of 

S —^ i7A{A~>u, A 1 —^ Z?£zA]A 2 Z?, A 2 —^ A x ab, aA x —^ baa, bA 2 b ^ abab 
Test whether vc = baabbabaaabbaba 

is in L(G). 


Solution 

We have to start with an S-production. At every stage we apply a suitable 
production which is likely to derive w. In this example, we underline the 
substring to be replaced by the use of a production. 

S => aA x A 2 a 

=> baa A 0 a 


Therefore. 


=> baa Ai a&a 

=> baab aA x A 2 oaba 

=> baabbaa A 2 baba 

=> baabba aA x abbaba 
=> baabbabaaabbaba = w 

iv e L(G) 


EXAMPLE 4.14 


If the grammar G is given by the productions S —> aSa | Z?SZ? | aa | bb \ A, 
show that (i) L(G) has no strings of odd length, (ii) any string in L(G) is of 
length 2 n, n > 0 , and (iii) the number of strings of length 2 n is 2 U . 

Solution 

Or application of any production (except S —> A), a variable is replaced by 
two terminals and at the most one variable. So, every step in any derivation 
increases the number of terminals by 2 except that involving S —» A. Thus, 
we have proved (i) and (ii). 
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To prove (iii), consider any string w of length 2m Then it is of the form 
ci x a 2 . . . ci u a n . , . a x involving n ’parameters' a x , ci 2 , . . a n . Each a { can be 
either a or b. So the number of such strings is 2T This proves (iii). 

4.2 CHOMSKY CLASSIFICATION OF LANGUAGES 

In the definition of a grammar (V^ X, P } S), V N and X are the sets of symbols 
and S e Vy. So if we want to classify grammars, we have to do it only by 
considering the form of productions, Chomsky classified the grammars into 
four types in terms of productions (types 0-3). 

A type 0 grammar is any phrase structure grammar without any restrictions. 
(All the grammars we have considered are type 0 grammars.) 

To define the other types of grammars, we need a definition. 

In a production of the form 0Ay/-^ (pay* where A is a variable, 0 is called 
the left context, y/ the right context, and day/ the replacement string. 


EXAMPLE 4.15 


(a) In abAbcd abABbccl ab is the left context, bed is the right context. 
a = AB. 

(b) In AC —» A, A is the left context. A is the right context, a = A. The 
production simply erases C when the left context is A and the right 
context is A. 

(c) For C -» A, the left and right contexts are A. And a - A. The 
production simply erases C in any context. 

A production without any restrictions is called a type 0 production. 

A production of the form 0A y/ 0ay/ is called a type 1 production if 
a ^ A. In type 1 productions, erasing of A is not permitted. 


EXAMPLE 4.16 


(a) aAbcD a bcD bcD is a type 1 production where where a, bcD are the 
left context and right context, respectively. A is replaced by bcD ^ A. 

(b) AB —» A bBc is a type 1 production. The left context is A, the right 
context is A. 

(c) A —» cibA is a type 1 production. Here both the left and right contexts 
are A. 

Definition 4.7 A grammar is called type 1 or context-sensitive or context- 
dependent if all its productions are type 1 productions. The production 5 —> A 
is also allowed in a type 1 grammar, but in this case S does not appear on the 
right-hand side of any production. 

Definition 4.8 The language generated by a type 1 grammar is called a 
type 1 or context-sensitive language. 
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Note: In a context-sensitive grammar G, we allow 5 A for including A 
in L(G). Apart from S —> A, all the other productions do not decrease the 
length of the working string. 

A type 1 production (pA iff (pay/ does not increase the length of the 
working string. In other words. \ <pAy/\ < | (pay /1 as a * A. But if a -» jS 
is a production such that | a | < | p |. then it need not be a type 1 production. 
For example, BC —» C£ is not of type 1. We prove that such productions can 
be replaced by a set of type 1 productions (Theorem 4.2). 

Theorem 4.1 Let G be a type 0 grammar. Then we can find an equivalent 
grammar G\ in which each production is either of the form a —> /3, where a 
and P are strings of variables only, or of the form A —» a , where A is a variable 
and a is a terminal. G { is of type 1, type 2 or type 3 according as G is of type 
1, type 2 or type 3. 

Proof We construct G } as follows: For constructing productions of Gj, 
consider a production a —> ft in G, where a or P has some terminals. In both 
a and 8 we replace every terminal by a new variable Q and get a" and /T. 
Thus, corresponding to every a -4 ft, where a or ft contains some terminal, we 
construct a' —> ft' and productions of the form C a a for every terminal 
a appearing in a or p. The construction is performed for every such a p. The 
productions for Gi are the new productions we have obtained through the above 
construction. For G\ the variables are the variables of G together with the new 
variables (of the form C a ). The terminals and the start symbol of Gj are those 
of G. G 1 satisfies the required conditions and is equivalent to G. So L(G) = 
L(Gj). I 

Definition 4.9 A grammar G = (V.y, 2, P t S) is monotonic (or length- 
increasing) if every production in P is of the form a —> p with \a \ < | P j 
or 5 —> A. In the second case. S does not appear on the right-hand side of any 
production in P. 

Theorem 4.2 Every monotonic grammar G is equivalent to a type 1 grammar. 

Proof We apply Theorem 4.1 to get an equivalent grammar Gj. We construct 
G' equivalent to grammar G\ as follows: Consider a production A\A 2 ... A m —> 
BiB 2 ... B u with n > m in G { . If m = 1, then the above production is of 
type 1 (with left and right contexts being A). Suppose m > 2. Corresponding 
to A { A 2 . .. A m B]B 2 . .. B n , we construct the following type 1 productions 
introducing the new variables Cj. C 2 , .. C m . 

Aj A 2 . . . A m —^ Gj A 2 . . . A in 
C\ A-* • • • A m C { Ci_A 2 . . . A m 
C x C 2 A 3 . . . A m -» CiC.Q A 4 ' . . . A m . .. 

C\C 2 . . . C m _i A m QC 2 .. . . ... B n 
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Q C 2 . . . C m B m+i .. . B n —> B±C 2 . . . C m B m+] ... B n 

B[Ci_C 2 ... B n -> 5i^_C 3 ... 5*. . . 

B\B 2 . . • Q/^hi+i ’ ♦ • B n —> B { B 2 . . . ^ . . . B n 

The above construction can be explained as follows. The production 
A [A 2 . . . A m —> B\B 2 * ■ • B fl 

is not of type 1 as we replace more than one symbol on L.H.S. In the chain of 
productions we have constructed, we replace A { by C u A 2 by C 2 . . ., A m by 
C m B m+x . . . B n . Afterwards, we start replacing C { by B h C 2 by B 2 , etc. As we 
replace only one variable at a time, these productions are of type 1. 

We repeat the construction for every production in G { which is not of 
type 1. For the new grammar G\ the variables are the variables of G { together 
with the new' variables. The productions of G' are the new type 1 productions 
obtained through the above construction. The terminals and the start symbol 
of G' are those of G r 

G ' is context-sensitive and from the construction it is easy to see that 
L(G') = L(Gy) = L(G). I 

Definition 4.10 A type 2 production is a production of the form A —» a, 
where A e V' v and a e (T v '^ ! £)*• In other words, the L.H.S. has no left 
context or right context. For example, 5 —» Aa, A —» cl B —» abc\ A -» A are 
type 2 productions. 

Definition 4.11 A grammar is called a type 2 grammar if it contains only 
type 2 productions. It is also called a context-free grammar (as A can be 
replaced by a in any context). A language generated by a context-free grammar 
is called a type 2 language or a context-free language. 

Definition 4.12 A production of the form A a or A —> aB . where 
A, B € K v and a e Z. is called a type 3 production. 

Definition 4.13 A grammar is called a type 3 or regular grammar if all its 
productions are type 3 productions. A production S —> A is allowed in type 3 
grammar, but in this case S does not appear on the right-hand side of any 
production. 


EXAMPLE 4.17 


Find the highest type number wTich can be applied to the following 
productions: 

(a) S —> Aa, A —> c\Ba. B —> abc 

(b) S —» ASB | cl, A —> ciA 
(c ) S —> aS | ab 
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Solution 

(a) S —> Aa . A —» B —» a&c are type 2 and A —> c is type 3. So the 

highest type number is 2. 

(b) S —» ASZ? is type 2. 5 —> cL A -> aA are type 3. Therefore, the highest 
type number is 2, 

(c) S —> uS is type 3 and 5 —> is type 2. Hence the highest type 

number is 2. 

4.3 LANGUAGES AND THEIR RELATION 

In this section we discuss the relation between the classes of languages that we 
have defined under the Chomsky classification. 

Let^ 0 . ^ csi ,/ cfi and £ vl denote the family of type 0 languages, context- 
sensitive languages, context-free languages and regular languages, respectively. 

Property 1 From the definition, it follows that £ x] c / c n- d csX c £& 
^cfl £ ^o* 

Property 2 £ cfl c x cs] . The inclusion relation is not immediate as we allow 
A —> A in context-free grammars even when A * S, but not in context-sensitive 
grammars (we allow only 5 —> A in context-sensitive grammars). In Chapter 6 
we prove that a context-free grammar G with productions of the form A —» A 
is equivalent to a context-free grammar G { which has no productions of the 
form A —> A (except 5 -4 A). Also, when G x has S -> A, S does not appear 
on the nght-hand side of any production. So G { is context-sensitive. This 
proves d c f\ c X cs] . 

Property 3 c -i cn c x cs! c x 0 . This follows from properties 1 and 2. 
Property 4 jf rl c* d ci] c* / csl c* i 0 . 

In Chapter 5. we shall prove that Ji x \ cz^ i; cfl . In Chapter 6. we shall 
prove that d cfl c* x csl . In Section 9.7, we shall establish that =£ csl o* =£ 0 . 

Remarks 1. The grammars given in Examples 4.1-4.4 and 4.6-4.9 are 
context- free but not regular. The grammar given in Example 4.5 is regular. The 
grammars given in Examples 4.10 and 4.11 are not context-sensitive as we have 
productions of the form 2A X Aj2, CB —» BC which are not type 1 rules. But 
they are equivalent to a context-sensitive grammar by Theorem 4.2, 

2. Two grammars of different types may generate the same language. For 
example, consider the regular grammar G given in Example 4.5. It generates 
{a. by. Let G' be given by S —» SS|aSj£S|<:/jZ?. Then L(G') = L(G) as the 
productions S —» aS | bS | a j b are in G as well, and S —> SS does not generate 
any more string. 

3. The type of a given grammar is easily decided by the nature of 
productions. But to decide on the type of a given subset of £*. it is more 
difficult. By Remark 2, the same set of strings may be generated by a grammar 
of higher type. To prove that a given language is not regular or context-free, 
we need powerful theorems like Pumping Lemma. 
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4.4 RECURSIVE AND RECURSIVELY ENUMERABLE SETS 

The results given in this section will be used to prove £ cs> \ Qt £o Section 9.7. 
For defining recursive sets, we need the definition of a procedure and an 
algorithm. 

A procedure for solving a problem is a finite sequence of instructions 
which can be mechanically carried out given any input. 

An algorithm is a procedure that terminates after a finite number of steps 
for any input. 

Definition 4.14 A set X is recursive if we have an algorithm to determine 
whether a given element belongs to X or not. 

Definition 4.15 A recursively enumerable set is a set X for which we have 
a procedure to determine whether a given element belongs to X or not. 

It is clear that a recursive set is recursively enumerable. 

Theorem 4.3 A context-sensitive language is recursive. 

Proof Let G = (V N , E, P, S) and w e E*. We have to construct an algorithm 
to test whether w e L(G) or not. If w = A, then w e L(G) iff S A is in 
P. As there are only a finite number of productions in P, we have to test 
whether S A is in P or not. 

Let | w | = n > 1. The algorithm is based on the construction of a sequence 
{Wj} of subsets of (V N u E)*. Wj is simply the set of all sentential forms of 
length less than or equal to n , derivable in at most i steps. The construction 
is done recursively as follows: 

(i) W 0 = {5}. 

(ii) W /+1 = Wj u {/? e (V N u E)*| there exists a in Wj such that a fi 
and | /? | < n}. 

Wfs satisfy the following: 

(iii) Wi c W M for all i > 0. 

(iv) There exists k such that W k = W k+ 1 . 

(v) If k is the smallest integer such that W k = W k+h then W k = 
{a e (V N u E}*|5 => a and |a| < n). 

The point (iii) follows from the point (ii). To prove the point (iv), w^e 
consider the number N of strings over V N u E of length less than or equal to 
n. If | V v u E | = m, then N = 1 + m + m~ + ...+ m 11 since m l is the number 
of strings of length i over V N u E. i.e. N = (m n+l - 1 )/{m - 1), and N is fixed 

as it depends only on n and m. As any string in is of length at most n , 

| Wj | < N. Therefore, W k = W M for some k < N. This proves the point (iv). 
From point (ii) it follows that W k = W k+X implies W k+ \ = W k+2 - 

{a e (V N u E)* ] S => a. | a\ < n] - W { u W 2 u . . . u W k u W k+l ... 

= u W 2 u . . . u W k 

= W k from point (iii) 


This proves the point (v). 
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From the point (v) it follows that w e L(G ) (i.e. S => w) if and only if 
w e W t Also, W b W 2 , .. .. W k can be constructed in a finite number of steps. 
We give the required algorithm as follows: 

Algorithm to test whether w e L(G). 1. Construct W b W 2f . . . using the 

points (i) and (ii). We terminate the construction when W k+i - W k for the first 
time. 

2. If w e W h then w e L(G). Otherwise, w € L(G). (As \W k \<N, testing 
whether w is in W k requires at most N steps.) I 


EXAMPLE 4.18 


Consider the grammar G given by S —> 0 SA{2, S —> 012, 2A! Aj2, 
lAj —> 11. Test whether (a) 00112 e L(G) and (b) 001122 e L(G). 

Solution 

(a) To test whether w = 00112 € L(G ), we construct the sets W 0> W b W 2 
etc. | w [ = 5. 

W 0 = {5} 

W x = {012, 5, 0 SA } 2) 

W 2 = {012, S, 0SA]2} 

As W 2 = W b we terminate. (Although 05A]2 => 0012A|2, we cannot 
include 0012AJ2 in W\ as its length is > 5.) Then 00112 £ W b Hence, 
00112 g L(G). 

(b) To test whether w = 001122 € L(G). Here, \ w\ = 6. We construct W 0 , 
W b W 2> etc. 

W 0 = {5} 

W x = {012, 5, 0SA X 2} 

W 2 = {012, 5, 0SA X 2, 0012Ai2} 

W 3 = {012, 5, 05A t 2, 0012A]2, 001A 1 22} 

W 4 = {012, 5, 0SA X 2 9 0012Aj2, 00^22, 001122} 

W s = {012, 5, 05Aj2, 0012AA 00^22, 001122} 

As W 5 = W 4 , we terminate. Then 001122 e W A . Thus. 001122 e L(G). 

The following theorem is of theoretical interest, and shows that there 
exists a recursive set over {0, 1} which is not a context-sensitive language. The 
proof is by the diagonalization method which is used quite often in set theory. 

Ineorem 4.4 There exists a recursive set which is not a context-sensitive 
language over {0, 1}. 

Proof Let S = {0, 1}. We write the elements of Z* as a sequence (i.e. the 
elements of E* are enumerated as the first element, second element, etc.) For 
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example, one such way of writing is A, 0, L 00, 01, 10, 11, 000, .... In this 
case, 010 will be the 10 th element. 

As every grammar is defined in terms of a finite alphabet set and a finite 
set of productions, we can also write all context-sensitive grammars over X as 
a sequence, say Gj, G 2 , ... . 

We define X - {w f - e X* | w; <£ L(G/)}. We can show that X is recursive. 
If w e X*, then we can find i such that w = w t . This can be done in a finite 
number of steps (depending on | w |). For example, if w = 0100, then w = w 2 q . 
As G 2 q is context-sensitive, we have an algorithm to test whether w = w 20 e 
L(G 2 q) by Theorem 4.3. So X is recursive. 

We prove by contradiction that X is not a context-sensitive language. If it 
is so, then X = L(G n ) for some n. Consider w u (the nth element in X*). By 
definition of X y w n e X implies w n g L(G„). This contradicts X - L(G n ). 
w u <£ X implies w n e L(G n ) and once again, this contradicts X = L(G n ). Thus, 
X ^ L(G n ) for any n, i.e. X is not a context-sensitive language. I 


4.5 OPERATIONS ON LANGUAGES 

We consider the effect of applying set operations on =£ csl , £ c f \, £ r \ . Let 
A and B be any two sets of strings. The concatenation AB of A and B is 
defined by AB = {uv \ u e A, v e B). (Here, uv is the concatenation of the 
strings u and v.) 

We define A 1 as A and A ,n] as A n A for all n > 1. 

The transpose set A r of A is defined by 

A t - {u T |u e A} 

Theorem 4.5 Each of the classes £ cfu £ rX is closed under union. 

Proof Let Lj and L 2 be two languages of the same type i. We can apply 
Theorem 4.1 to get grammars 

G { = (Vm Xj, Pj, S x ) and G 2 = (V% X 2 , P 2 , A) 

of type i generating L x and L 2 , respectively. So any production in G x or G 2 
is either a —> $ where a, [5 contain only variables or A —» a } where A e 
a e X. 

We can further assume that L' v n V'n = 0 • (This is achieved by renaming 
the variables of V" N if they occur in V' N .) 

Define a new grammar G u as follows: 

G„ = (V'v U {5}, X] u X 2 , P K , 5) 
where S is a new symbol, i.e. S £ V' N u V'^ 

P u — P i KJ P2 ^ { S —> S\ . S — > S 2 } 
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We prove L(G U ) = L l u L 2 as follows: If w e Lj u Lo, then Si => w or 
^ o 

So => w. Therefore. 1 

a 


w or 


w, i.e. w € L(G W ) 


Thus, u L 2 e L(G U ). 

To prove that L(G U ) c Li u L 2 , consider a derivation of w. The first step 
should be S Si or S => So. If S => Sj is the first step, in the subsequent steps 

Si is changed. As V' N n V'^; ^ 0, these steps should involve only the variables 

* 

of W v and the productions we apply are in P v So S => w. Similarly, if the 

Ci 

first step is S => So, then S => So => w. Thus, L(G U ) = L { u Lo. Also, L(GJ 

G : ~ G, 

is of type 0 or type 2 according as L } and Lo are of type 0 or type 2. If A 
is not in L\ u L 2 , then L(GJ is of type 3 or type 1 according as L { and L 2 
are of type 3 or type 1. 

Suppose A 6 Lj. In this case, define 

G u = (V' N u V" N u {S, S'}, li u S 3 S') 

where (i) S' is a new symbol, i.e. S' £ V' N u V” N u {S}, and (ii) P u = 
P* u Po u {S' —» S, S —¥ Si, S —» S 2 }. So, L(GJ is of type i or type 3 
according as L x and L 2 are of type 1 or type 3. When A e L 2 , the proof is 
similar. 1 


Theorem 4.6 Each of the classes L 0 , £ csU £ cfh £ ri is closed under 
concatenation. 


Proof Let L x and L 2 be two languages of type L Then, as in Theorem 4.5, we 
get G\ = (V'n, Si. P\, Si) and G 2 = ( V" N , S 2 , P 2 , S 2 ) of the same type z. We 
have to prove that LiL 2 is of type z. 

Construct a new grammar G C0I1 as follows: 

Gcon - (V' N U n U {SK Si u In, Peon, S) 


where S £ V' N u T'v • 


P con — P] tj Po kj (S — ^ SjSo} 
We prove L X L 2 = L(G C0U ). If w = w t w 2 e LjLo, then 

S] => Hq, So => Wo 

G, G : 

So. 


Therefore, 


S => Si S 2 => w i Wo 

Cron Cmn 


L,L 2 c L(G con ) 
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If w e L(G con ), then the first step in the derivation of w is 5 => S X S 2 - 

As V' N n V" N = 0 and the productions in Gj or G 2 involve only the variables 

* 

(except those of the form A —> a), we have w = W\W% where S w j and 

G] 

5 ^ vr 2 . Thus LjL 2 = L(G con ). Also, G con is of type 0 or type 2 according 

as G { and G 2 are of type 0 or type 2. The above construction is sufficient when 
G\ and G 2 are also of type 3 or type 1 provided A g L { u L 2 . 

Suppose Gj and G 2 are of type 1 or type 3 and A e L l or A e L 2 . Let 
L'i = L x - {A}, L' 2 = L 2 - {A}. Then 


LjL 2 


L[L' 2 u L 2 if A is in L } but not in L 2 

< L[L 2 u Lj if A is in L 2 but not in L l 

L[L 2 u L{ u Z4 u {A} if A is in L } and also in 


As we have already shown that =£ csl and £ tX are closed under union, L { L 2 is 
of type 1 or type 3 according as L x and L 2 are of type 1 or type 3. I 

Theorem 4.7 Each of the classes jf csl , =L cfi , £ xX is closed under the 
transpose operation. 

Proof Let L be a language of type i. Then L = L(G), where G is of type i. 

We construct a new grammar G T as follows: G T = (V N , E, S), where the 
productions of P 7 are constructed by reversing the symbols on L.H.S. and 
R.H.S. of every production in P. Symbolically, a r /3 7 is in P r if a -» /3 is 
in P, 

From the construction it is obvious that G r is of type 0, 1 or 2 according 
as G is of type 0, 1 or 2 and L(G r ) = L r . For regular grammar, the proof is given 
in Chapter 5. 

It is more difficult to establish the closure property under intersection at 
present as we need the properties of families of languages under consideration. 
We state the results without proof. We prove some of them in Chapter 8. 

Theorem 4.8 (i) Each of the families £q X , £ cs \, £ x \ is closed under 

intersection. 

(ii) £ c f\ is not closed under intersection. But the intersection of a context- 
free language and a regular language is context-free. 


4.6 LANGUAGES AND AUTOMATA 

In Chapters 7 and 9, we shall construct accepting devices for the four types 
of languages. Figure 4.1 describes the relation between the four types of 
languages and automata: TM, LBA, pda, and FA stand for Turing machine, 
linear bounded automaton, pushdown automaton and finite automaton, 
respectively. 
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Languages 


Automata 



Fig. 4.1 Languages and the corresponding automata. 


4.7 SUPPLEMENTARY EXAMPLES 


EXAMPLE 4.19 


Construct a context-free grammar generating 

(a) L x - {a n b 2n | n > 1} 

(b) Li = {a w Z? /z | ra > n, m, n > 1 } 

(c) L 3 = {tf m ZP | m < n, m, n, > 1 } 

(d) L 4 = {ci n b n | m, n > 0 . m & n) 

Solution 

(a) Let Gj = ({5}, {a. b }. P, 5) where P consists of S aSbb, S abb . 

(b) Let G 2 = ({5, A}, {a, Z?}, P, S) where P consists of 5 ^ aS | aA, 
A —> aAb y A —» ab. It is easy to see that L(G) c L 2 . We prove the 
difficult part. Let a m b n e L 2 . Then, m > n > 1. As m > n, we have 
t ii - n > L So the derivation of d n b tl - a m ~ n a n b n is 

5 =b> => a m ~ H aA =?> a^a^Ab 11 ” 1 <P ? ZP 

(c) Let G 3 = ({ S ' P}. {a, Z?}. P, 5) where P consists of S SZ? | Bb, 
B uPZ? ; P —> aZ?. This construction is similar to construction in (b). 
S SZ? | Bb are used to generate b n ~ m . The remaining productions 
will generate a m b m . Hence L(G 3 ) = L 3 . 

(d) Note that L x - L 2 u u L' u L" where L' — {b n \n > 1} and 
L" = {a n | n > 1}. It is easy to see how to construct grammars 
generating L 2 . T 3 and L' and L "' Define G 4 by combining these 
constructions. Let 

G 4 = ({5, 5j, S 2 . S 3 , S 4 , A}, {a, Z>}, P 4 , S) where P 4 consists of 
S —» S| | S 2 | S 3 | S 4 , Si —» r/Si | Aa | aAZ? | ab, 

S 2 —^ S 2 Z? j AZl S 3 —^ Z?S^ J Z?, and S 4 —^ #S 4 j u. 

It is easy to see that L(G 4 ) = L 4 . 
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EXAMPLE 4.20 


Construct a grammar accepting 

L = { w e { a , 6}* | the number of a s in w is divisible by 3}. 

Solution 

Consider a string over {a, Z?} having three ms. These three ms appear amidst 
the strings of b. A typical string is b m ab n ab s ab\ For generating b”\ we can have 
the productions S —» bS . For getting the first a, we can have S aA. For 
getting Zf afterwards, we have A —> M. For getting the second a, we can have 
A — > a.F. For getting b\ we have F — > Z?F. For getting the third a and repetition 
of this pattern, we can have B —» a | aS. 

Now- we construct G as follows: 

G = ({5, A, F}, { a , Z>}, F. 5") where F consists of 

5 Z?S, S -*-» r/A, A —» M, A —» u?F, B bB, B m B aS . 

A string in L is of the form yjys . .. y„ where each y f is of the form 
b m ab n ab s a for some nonnegative integers m, n and s. 

S Zf"S =» b m aA £> Zf WA =* b m ab n aB £> b m ab n ab s S 

Hence G is the required grammar. 


EXAMPLE 4.21 


Construct a grammar G such that 

L(G) = { w e {cl b} | w has an equal number of as and Zfs}. 

Solution 

Define G = ({S, A. F}, {a, b}. P, S ) where F consists of 

5 -> ciB | Z?A, A -> aS | Z>AA | m F -> Z>S | r/FF | b 

To prove that G accepts the given language, we prove the following by 
induction on |w|. 

(i) S => vr if and only if w consists of an equal number of and V s. 

(ii) A =!> w if and only if w consists of one more a than the number of Zfs. 

(iii) F z=> w if and only if w consists of one more b than the number of ms. 

The ponits (i), (ii). (iii) are true when \w \ = 1. For A => a and a is the 
only string of length one which can be derived from F. Also, no string of 
length one is derivable from 5. Thus, there is basis for induction. 

Assume points (i), (ii), (iii) to be true for all strings of length k - 1. Let 
| w | = k. 

We prove the ‘only if part of (i). Let S ==> >v. Then the first production 
has to be either S —> aB or S bA. If the first production is S —» aF, then 
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S => =E> w\ Hence w = mvj, B z-> >iq and Jm-j | = k - 1. By induction 

hypothesis, the point (iii) is true for Wj. This means that w { has one more b 
than the number of Vs. Hence w = aw x has an equal number of as and Vs. 
(The proof is' similar if the first production is 5 bA.) 

To prove the 'if part’, assume that w has an equal number of a 's and Vs. 
If w starts with a, then w = aw x , where \w { | = k - 1. (If w starts with b the 
proof is similar.) Also has one more b than the number of a' s. By induction 
hypothesis, the point (iii) is true for nq. Then 3 => wq. As 5 —> is a 
production, we have 5 => aB. So 5 => aB ==> aw\ = w, i.e. S =e> w, which 
proves the 'if part’ of point (i). 

Similarly we can prove the points (ii) and (iii) for a string w of length k. 
By the principle of induction the points (i), (ii), (iii) are true for all strings w. 

In particular, from point (i), we can conclude that 

L(G) = {w e { a , b} | w has an equal number of a’s and Vs} 


EXAMPLE 4.22 


Construct a grammar G accepting the set L of all strings over {a, b} having 
more Vs than Vs. 

Solution 

For generating strings with a's but with no Vs, we can have the production 
S —» a, S -» aS, S Sa. (If xci e L, then ax e L. Hence we have S aS and 
S —> Sa.) If x, y e L , then xy e L (also yx s L). So xy or yx has at least two 
more Vs than Vs. So we can add a b. This can be done by having the 
productions S —> bSS, S — > SbS, S SSb. (i.e. b can be added at the beginning, 
or at the end of SS, or between S and S). With this motivation, we can construct 
G as follows: 

G = ({5}, {a, b }, P. S) w'here P consists of 

S a | aS | Sa | bSS | SbS | SSb 

We can easily prove that G accepts all strings over { a , b\ having more Vs 
than Vs. 


EXAMPLE 4.23 


Construct a grammar G accepting all strings over {«, b} containing an unequal 
number of Vs and Vs. 

Solution 

As m Example 4.20, we can construct a grammar accepting all strings having 
more Vs than a's. The required language is the union of the language of 
Example 4.20 and a similar one having more Vs than a's. So we construct 
G as follows: 
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G = ({S, Sj, S 2 }> {a, b }, P, S) where P consists of 

S -> S t | s 2 

S\ a | aS- { | S x a | bS x S x | S x bS x | SiS x b 
S 2 —| bS 2 | 5 2 i> | aS 2 S 2 | S 2 aS 2 \ SS 2 cl 

G generates all strings over {a, b} having an unequal number of s and /As. 


EXAMPLE 4.24 


If Li and L 2 are the subsets of {a, b}*, prove or disprove: 

(a) If Z.| c L 2 and L x is not regular, then L 2 is not regular. 

(b) If Lj c L 2 and L 2 is not regular, then L x is not regular. 

Solution 

(a) Let Lj = {a n b n \ n > 1}. L { is not regular. Let L 2 = {a, b}*. By 
Example 4.5, L 2 is regular. Hence (a) is not true. 

(b) Let L 2 = {a"b n \ n > 1}. It is not regular. But any finite subset is 

regular. Taking L, to be a finite subset of L 2 , we disprove (b). 


EXAMPLE 4.25 


Show that the set of all non-palindromes over { a , b] is a context-free 
language. 

Solution 

Let w e {a, b}* be a non-palindrome. Then w may have the same symbol in 
the first and last places, same in the second place from the left and from the 
right, etc.; this pattern will not be there after a particular stage. The 
productions S — > aSa | bSb \ A may be used for fulfilling the palindrome- 
condition for the first and last few places. For violating the palindrome 
condition, the productions of the form A —» ciBb\bBa and B —> aB \bB\A 
will be useful. So the required grammar is G = ({5, A, B }, { a, b }, F, 5) 
where P consists of 

S —> aSci | bSb | A 
A ciBb | bBa 
B — > aB | bB | A 

SELF-TEST 

to Questions 1-12: 

productions S —> 55, 5 — > aSb, S —> bSa , S —> A, 
(b) S =?> abba 
(d) 5 => aaa 


Choose the correct answers 

1. For a grammar G with 
(a) S => abba 
(c) abba <£ L(G) 
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2. If a p in a grammar G, then 

(a) a => ft (b ) ft => a 

(c) p => (X (d) none of these 

3. If a —» p is a production in a grammar G, then 

(a) aa pp (b) aap => ppa 

(c) aa =» /3a (d) aaa r^> ppp 

4. If a grammar G has three productions S —> aSa | bsb j c, then 

(a) abcba and bacab e L(G) (b) abcba and abcab e L(G) 

(c) rzcccfl and bcccb e L(G) (d) acccfc and iccca € L(G) 

5. The minimum number of productions for a grammar G = ({5}, {0, 1, 
2, .... 9}, P, 5) for generating {0, 1. 2, ..9} is 

(a) 9 (b) 10 

(c) 1 (d) 2 

6 . If G l = (N, Z P h S) and G 2 = (N, Z, P 2 , 5) and P ] c P 2? then 

(a) L(G0 e L(G 2 ) (b) L(G 2 ) c L{G X ) 

(c) L(G{) = L(G 2 ) (d) none of these. 

7. The regular grammar generating {a 11 : n > 1} is 

(a) ({5}, {a}, {5 -> tfS}, 5) 

(b) ({5}, {a}, {5 55, 5 a}) 

(c) ({S}, {a}, {5 -> aS}, 5) 

(d) ({5}, {#}, {5 —> S —> a. 5) 

8 . L ~ {theory, of, computer, science} can be generated by 

(a) a regular grammar 

(b) a context-free grammar but not a regular grammar 

(c) a context-sensitive grammar but not a context-free grammar 

(d) only by a type 0 grammar. 

9. {d l | n > 1} is 

(a) regular 

(b) context-free but not regular 

(c) context-sensitive but not context-free 

(d) none of these. 

10, {a n b n \n > 1 } is 

(a) regular 

(b) context-free but not regular 

(c) context-sensitive but not context-free 

(d) none of these. 

M . {a n b 1l c n | /? > 1 } is 

(a) regular 

(b) context-free but not regular 

(c) context-sensitive but not context-free 

(d) none of these. 
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12. {a u b n c m | ft, m > 1 } is 

(a) regular 

(b) context-free but not regular 

(c) context-sensitive but not context-free 

(d) none of these. 

State whether the following Statements 13-20 are true or false: 

13. In a grammar G - (V N) X, P, S ), and X are finite but P can be 

infinite. 

14. Two grammars of different types can generate the same language. 

15. If G = (V N , X, P, S ) and P * 0, then L(G) * 0. 

16. If a grammar G has three productions, i.e. 5 -4 AA, A —> A -4 

to, then L(G) is finite. 

17. If Li = {a il b m \m, n > 1} and L 2 = { b m cP | m, /?>!}, then L x n L 2 = 

> 1 }. 

18. If a grammar G has productions 5 —> aS\bS\a y then L(G) - the set of 
all strings over {a, b} ending in cl 

19. The language {a n bc H \n > 1} is regular. 

20. If the productions of G are S —> ciS\Sb\a\b, then abab e L(G). 


EXERCISES 

4.1 Find the language generated by the following grammars: 

(a) S -4 051|0A1, A —> 1A| 1 

(b) S -4 05110A101 | IP | 1, A -4 0A10, B -> IB 11 

(c) S -4 05BA 101 A, AB -4 BA , IB -> 11, 1A -4 10, 0A -4 00 

(d) S -4 051 |0A1, A 1A0 j 10 

(e) 5 -4 0A 115| 0| 1, A -4 1A 11S| 1 

4.2 Construct the grammar, accepting each of the following sets: 

(a) The set of all strings over {0, 1} consisting of an equal number of 
0 ’s and Ls. 

(b) {0 n l m 0 m r | m, n > 1 } 

(c) {0 u l 2n \ n > 1 } 

(d) {0 n l n \ n > 1 } u {IV | m > 1 } 

(e) {0 n l m 0 ,l \m, n > 1 } u {0 n l m 2 m \ m, n > 1 }. 

4.3 Test whether 001100, 001010, 01010 are in the language generated by 
the grammar given in Exercise 4.1(b). 

4.4 Let G = ({A, B , 5}, {0, 1}, P, S\ where P consists of 5 —> 0AB, 
A 0 -4 50B, Ai -4 5B1, B -4 SA, B -4 01. Show that L(G) = 0. 
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4.5 Find the language generated by the grammar S AB , A —> Al j 0, 

B -4 2B | 3. Can the above language be generated by a grammar of 
higher type? 

4.6 State whether the following statements are true or false. Justify your 
answer with a proof or a counter-example. 

(a) If G] and G 2 are equivalent, then they are of the same type. 

(b) If L is a finite subset of X*, then L is a context-free language. 

(c) If L is a finite subset of X*, then L is a regular language. 

4.7 Show that {a il ~ | n > 1} is generated by the grammar S a, S —» A 3 A 4 , 

A 3 —* AiAiA'y, A 3 —^ A [A 2 , A]A 2 —^ (iA 2 A\ y A\ci —^ ciAy A 2 a. —> aA 2 , 

AjA 4 —^ A 4 cl, A 2 A 4 —^ A§a, A 2 A 5 —^ A^ci, A 5 —^ cl. 

4.8 Construct (i) a context-sensitive but not context-free grammar, (ii) a 
context-free but not regular grammar, and (iii) a regular grammar to 
generate {a u \n > 1}. 

4.9 Construct a grammar which generates all even integers up to 998. 

4.10 Constmct context-free grammars to generate the following: 

(a) {0 m l n \m ^ 72, m, n > 1 }. 

(b) {a l b m c ll \o ne of l m, n equals 1 and the remaining two are equal}. 

(c) { 0 "T ;1 11 < m < n }. 

(d) {a l b m c u \l 4- m = n). 

(e) The set of all strings over {0, 1} containing twice as many 0's as 

Vs. 

4.11. Construct regular grammars to generate the following: 

(a) { a 2n j n > 1 }. 

(b) The set of all strings over [a, h } ending in a. 

(c) The set of all strings over {a. b } beginning with a. 

(d) {ab m c n \l, m, n> 1}. 

(e) {( ab•)" | n > 1 }. 

4.12. Is ==> an equivalence relation on (V\? u X)*? 

G 

4.13. Show that G\ ~ ({5}, {a, b}, P b S ), where P j = {5 —» aSb\ab } is 

equivalent to G 2 = ({5, A, B , C}. {a, b} y P 2 , 5). Here P 2 consists of 
S —^ AC, C —^ SB. S —^ AB. A —> ci , B —> b. 

4.14. If each production in a grammar G has some variable on its right-hand 
side, w 7 hat can you say about L(G)? 

4.15. Show’ that { abc , bca. cab} can be generated by a regular grammar 
whose terminal set is {a, b, c}. 

4.16. Construct a grammar to generate {(abf\n > 1} u {( ba) u \n >1}. 

4.17. Show 7 that a grammar consisting of productions of the form A -4 xB j y. 
where x, y are in X* and A, B e V N , is equivalent to a regular grammar. 




Regular Sets and 
Regular Grammars 


In this chapter, we first define regular expressions as a means of representing 
certain subsets of strings over £ and prove that regular sets are precisely those 
accepted by finite automata or transition systems. We use pumping lemma for 
regular sets to prove that certain sets are not regular. We then discuss closure 
properties of regular sets. Finally, we give the relation between regular sets 
and regular grammars. 

5.1 REGULAR EXPRESSIONS 

The regular expressions are useful for representing certain sets of strings in an 
algebraic fashion. Actually these describe the languages accepted by finite state 
automata. 

We give a formal recursive definition of regular expressions over £ as 
follows: 

1. Any terminal symbol (i.e. an element of £), A and 0 are regular 
expressions. When we view a in £ as a regular expression, we denote 
it by a. 

2. The union of two regular expressions Rj and R 2 . written as + R 2 , 
is also a regular expression. 

3. The concatenation of two regular expressions Rj and R->, written as 
RiR 2 , is also a regular expression. 

4. The iteration (or closure) of a regular expression R, written as R*, is 
also a regular expression. 

5. If R is a regular expression, then (R) is also a regular expression. 

6. The regular expressions over £ are precisely those obtained 
recursively by the application of the rules 1-5 once or several times. 
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Notes: (1) We use x for a regular expression just to distinguish it from the 
symbol (or string) x. 

(2) The parentheses used in Rule 5 influence the order of evaluation of 
a regular expression. 

(3) In the absence of parentheses, we have the hierarchy of operations as 
follows: iteration (closure), concatenation, and union. That is, in evaluating a 
regular expression involving various operations, we perform iteration first, then 
concatenation, and finally union. This hierarchy is similar to that followed for 
arithmetic expressions (exponentiation, multiplication and addition). 

Definition 5.1 Any set represented by a regular expression is called a regular 
set . 

If, for example, a, b e Z, then (i) a denotes the set {a} } (ii) a + b denotes 
{a y b }, (iii) ab denotes {ab}, (iv) a* denotes the set {A. a, aa, aaa, . . .} and 
(v) (a -I- b)* denotes { a , b}*. 

The set represented by R is denoted by L(R). 

Now we shall explain the evaluation procedure for the three basic 
operations. Let R| and R 2 denote any two regular expressions. Then (i) a 
string in L(Rj + R 2 ) is a string from R} or a string from R 2 ; (ii) a string in 
L(R 1 R 2 ) is a string from R { followed by a string from R 2 , and (iii) a string 
in L(R*) is a string obtained by concatenating n elements for some n > 0. 
Consequently, (i) the set represented by Rj + R 2 is the union of the sets 
represented by R] and R 2 . (ii) the set represented by R t R 2 is the concatenation 
of the sets represented by R { and R 2 . (Recall that the concatenation AB of sets 
A and B of strings over Z is given by AB - {nq-vvo | vtq e A , w 2 e B }. and 
(iii) the set represented by R* is . . . vr„|w; is in the set represented by 

R and n > 0.} Hence. 

L(R ] + R : )= L( Rj) u L(R 2 ), L(R!R 2 ) = L(R,)L(R 2 ) 

L(R*) = (L( R))* 

Also. 

L(R*) = (L( R))* = U 
;?=0 

L(0) = 0, L( a) = {a}. 

Note: By the definition of regular expressions, the class of regular sets over 
Z is closed under union, concatenation and closure (iteration) by the conditions 
2, 3, 4 of the definition. 


EXAMPLE 5.1 


Describe the following sets by regular expressions: (a) {101}. (b) {abba}, 
(c {01. 10}, (d) {A. ab}, (e) {abb, a, b, bba}, (f) {A, 0, 00, 000. . . and 

(g) {1, 11. 111. ...}. 

Solution 

(a) Now. {1}, {0} are represented by 1 and 0. respectively. 101 is obtained 
by concatenating 1. 0 and 1. So. {101} is represented by 101. 
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(b) abba represents {abba}. 

(c) As {01, 10} is the union of {01} and {10}, we have {01, 10} 
represented by 01 + 10, 

(d) The set {A, ab} is represented by A + ab. 

(e) The set {abb, a, b, bba } is represented by abb + a + b + bba. 

(f) As {A, 0, 00, 000, . . .} is simply {0}*. it is represented by 0*. 

(g) Any element in {1, 11, 111, ...} can be obtained by concatenating 
1 and any element of {1}*. Hence 1(1)* represents {1, 11, 111, . . .}. 


EXAMPLE 5.2 


Describe the following sets by regular expressions: 

(a) L, = the set of all strings of 0’s and Ts ending in 00. 

(b) L, = the set of all strings of 0's and l's beginning with 0 and ending 

with 1. 

(c) l 3 = {a. il mi. linn. ...}. 


Solution 

(a) Any string in L x is obtained by concatenating any string over {0, 1} 
and the string 00. {0, 1} is represented by 0 + 1. Hence L x is 
represented by (0 + 1)* 00. 

(b) As any element of L, is obtained by concatenating 0, any string over 
{0, 1} and 1. L 2 can be represented by 0(0 + 1)* 1. 

(c) Any element of L-< is either A or a string of even number of l‘s, i.e. 
a string of the form (11)”, n > 0. So L ? can be represented by (11)*. 

5.1.1 Identities for Regular Expressions 

Two regular expressions P and Q are equivalent (we write P = Q) if P and 
Q represent the same set of strings. 

We now give the identities for regular expressions; these are useful for 
simplifying regular expressions. 

/, 0 + R = R 

/, 0R = R0 = 0 
/, AR = RA = R 
/ 4 A* = A and 0* = A 

I 5 R + R = R 

/ 6 R*R* = R* 

/ 7 RR* = R*R 

h (R*)* = R* 

Z 9 A + RR* = R* = A + R*R 

i io (PQ)*P = P(QP>* 
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In (P + Q)* = (P*Q*)* = (P* + Q*)* 

1 12 (P + Q)R = PR + QR and R(P + Q) = RP + RQ 

Note: By the ‘set P‘ we mean the set represented by the regular expression P. 

The following theorem is very much useful in simplifying regular 
expressions (i.e. replacing a given regular expression P by a simpler regular 
expression equivalent to P). 

Theorem 5.1 (Arden's theorem) Let P and Q be two regular expressions 
over I. If P does not contain A, then the following equation in R, namely 

R = Q + RP (5.1) 

has a unique solution (i.e. one and only one solution) given by R = QP*. 

Proof Q + (QP*)P = Q(A + P*P) = QP* by / 9 

Hence (5.1) is satisfied when R = QP*. This means R = QP* is a solution 
of (5.1). 

To prove uniqueness, consider (5.1). Here, replacing R by Q + RP on the 
R.H.S.. we get the equation 

Q + RP = Q + (Q + RP)P 
= Q + QP + RPP 
= Q + QP + RP 2 

= Q + QP + QP 2 + + QP' + RP +1 

= Q(A + P + P 2 + • • • + P') + RP +I 

From (5.1). 

R = Q(A + P + P 2 + • • • + PO + RF Vl for i > 0 (5.2) 

We now show that any solution of (5.1) is equivalent to QP*. Suppose R 
satisfies (5.1). then it satisfies (5.2). Let w be a string of length i in the set 
R. Then vv belongs to the set Q( A + P + P : 4- . . . + F) + RP M . As P does 
not contain A, RF +1 has no string of length less than / + 1 and so w is not 
in the set RF ] . This means that w belongs to the set Q( A + P + P 2 + . . . 
+ P £ ), and hence to QP*. 

Consider a string w in the set QP*. Then w is in the set QP ; for some 
k > 0. and hence in £?(A + P + P“ + • • • + P A ). So w is on the R.H.S. of 
(5.2). Therefore, w is in R (L.H.S. of (5.2)). Thus R and QP* represent the 
same set. This proves the uniqueness of the solution of (5.1). I 

Note: Henceforth in this text, the regular expressions will be abbreviated r.e. 


Example 5.3 


(a) Give an r.e. for representing the set L of strings in which every 0 is 
immediately followed by at least two Fs. 

(b) Prove that the regular expression R = A + 1*(011)*(1* (Oil)*)* also 
describes the same set of strings. 
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Solution 

(a) If w is in L, then either (a) w does not contain any 0, or (b) it contains 
a 0 preceded by 1 and followed by 11. So w can be written as 
W\W 2 . . . w Il9 where each w { is either 1 or Oil. So L is represented 
by the r.e. (1 4 Oil)*. 

(b) R = A + P^i*, where = 1*(011)* 

= P* using l 9 

= ( 1 *( 011 )*)* 

= (P*?* )* letting P 2 = 1, P 3 = Oil 

= (P 2 + P ? )* using I u 

= (1 4 Oil)* 


EXAMPLE 5.4 


Prove (1 4- 00*1) + (1 + 00*1)(0 + 10*1)* (0 + 10*1) = 0*1(0 + 10*1)*. 

Solution 

L.H.S. = (1 + 00*1) (A + (0 + 10*1)* (0 + 10*1)A using I X2 
= (1 + 00*1) (0 + 10*1)* using 1 9 

= (A 4 00*)1 (0 + 10*1)* using I n for 1 4 00*1 

= 0*1(0 4 10*1)* using I 9 

= R.H.S. 

5.2 FINITE AUTOMATA AND REGULAR EXPRESSIONS 

In this section we study regular expressions and their representation. 


5.2.1 Transition System Containing a-moves 

The transition systems can be generalized by permitting A-transitions or 
A-moves which are associated with a null symbol A. These transitions can 
occur when no input is applied. But it is possible to convert a transition system 
with A-moves into an equivalent transition system without A-moves. We shall 
give a simple method of doing it with the help of an example. 

Suppose we want to replace a A-move from vertex v x to vertex v 2 . Then 
we proceed as follows: 

Step 1 Find all the edges starting from v 2 . 

Step 2 Duplicate all these edges starting from v h without changing the edge 
labels. 
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Step 3 If V] is an initial state, make v 2 also as initial state. 
Step 4 If v 2 is a final state, make vj also as the final state. 


EXAMPLE 5.5 


Consider a finite automaton, with A-moves, given in Fig. 5.1. Obtain an 
equivalent automaton without A-moves. 



Solution 

We first eliminate the A-move from q Q to q x to get Fig. 5.2(a). q { is made 
an initial state. Then we eliminate the A-move from q 0 to q 2 in Fig. 5.2(a) 
to get Fig. 5.2(b). As q 2 is a final state, q 0 is also made a final state. Finally, 
the A-move from to q 2 is eliminated in Fig. 5.2(c). 



2 


(c) 

Fig. 5.2 Transition system for Example 5.5, without A-moves. 
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EXAMPLE 5.6 


Consider a graph (i.e. transition system), containing a A-move, given in 
Fig. 5.3. Obtain an equivalent graph (i.e. transition system) without A-moves. 



Fig. 5.3 Finite automaton of Example 5.6. 


Solution 

There is a A-move from q {) to r/ 3 . There are two edges, one from q 3 to q 2 with 
label 0 and another from q 3 to q± with label 1. We duplicate these edges from 
q 0 . As q 0 is an initial state. q 3 is made an initial state. The resulting transition 
graph is given in Fig. 5.4. 



Fig. 5.4 Transition system for Example 5.6, without A-moves. 


5.2.2 NDFAs with a-moves and Regular expressions 

In this section, we prove that every regular expression is recognized by a 
nondeterministic finite automaton (NDFA) with A-moves. 

Theorem 5.2 (Kleene’s theorem) If R is a regular expression over £ 
representing L c £*, then there exists an NDFA M with A-moves such that 
L = 7TM). 

Proof The proof is by the principle of induction on the total number of 
characters in R. By ‘character' we mean the elements of X, A, 0, * and +. 
For example, if R = A + 10*11*0, the characters are A, +, 1, 0, *, 1, 1, *, 
0, and the number of characters is 9. 

Let L(R) denote the set represented by R 
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Basis. Let the number of characters in R be 1. Then R = A, or R = 0, or 
R = a h a { e X. The transition systems given in Fig. 5.5 will recognize these 
regular expressions. 



R - A 



R = a "^^ 


Fig. 5.5 Transition systems for recognizing elementary regular sets. 


Induction step. Assume that the theorem is true for regular expressions having 
n characters. Let R be a regular expression having n + 1 characters. Then, 

R - P + Q or R = PQ or R = P* 

according as the last operator in R is +, product or closure. Also P and Q are 
regular expressions having n characters or less. By induction hypothesis, L(P) 
and L(Q) are recognized by M\ and M 2 where Mi and M 2 are NDFAs with 
A-moves, such that L(P) = T{M\) and L( Q) = T(M 2 ). M\ and M 2 are 
represented in Fig. 5.6. 



The initial state and the final states of M { and M 2 are represented in the usual 
way. 

Case 1 R = P + Q. In this case we construct an NDFA M with A-moves 
that accepts L(P + Q) as follows: q$ is the initial state of M, q 0 not in M ] 
or M 2 . qj is the final state of M: once again qf not in Mj or My M contains 
all the states of M } and M 2 and also their transitions. We add additional 
A-transitions from q$ to the initial states of and M 2 and from the final 
states of Mi and M 2 to qp The NDFA M is as in Fig. 5.7. It is easy to see 
that T(M) = T(Mi) u T(M 2 ) = L(P + Q). 

Case 2 R = PQ In this case we introduce q 0 as the initial state of M and 
qf as the final state of M, both q (h q f not in M 3 or My New A-transitions are 
added between q 0 and the initial state of My between final states of M { and 
the initial state of My and between final states of M 2 and the final state q f - of 
M, See Fig. 5.8. 
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Fig. 5.7 NDFA accepting L(P + Q). 



Case 3 R = (P)* In this case, q 0 , q and q f are introduced. New A-transitions 
are introduced from q 0 to q . q to qp q to the initial state of M x and from the 
final states of M { to q . See Fig. 5.9. 

Thus in all the cases, there exists an NDFA M with A-moves, accepting 
the regular expression R with n + 1 characters. By the principle of induction, 
this theorem is true for all regular expressions. I 



Fig. 5.9 NDFA accepting L(P*). 








Chapter 5: Regular Sets and Regular Grammars S 145 


Theorem 5.1 gives a method of constructing NDFAs accepting P + Q. PQ 
and P* using the NDFAs corresponding to P and Q. In the later sections we 
give a method of converting NDFA M with A-moves into an NDFA M ] 
without A-moves and then into a DFA M 2 such that T(M) = T(M\) - T(M 2 )- 
Thus, if a regular expression P is given, we can construct a DFA accepting 
L( P). 

The following theorem is regarding the converse. Both the Theorems 5.2 
and 5.3 prove the equivalence of regular expressions or regular sets and the 
sets accepted by deterministic finite automata. 

Theorem 5.3 Any set L accepted by a finite automaton M is represented by 
a regular expression. 

Proof Let 

M = ({<?! . . . q m ], X. 8, < 7 ], F) 

The construction that we give can be better understood in terms of the state 
diagram of M. If a string w e X* is accepted by M, then there is a path from 
q x to some final state with path value w. So to each final state, say q p there 
corresponds a subset of X* consisting of path values of paths from q 0 to q k 
As T(M) is the union of such subsets of X*. it is enough to represent them by 
regular expressions. So the main part of the proof lies in the construction of 
subsets of path values of paths from the state q x to the state q k 

Let Pij denote the set of path values of paths from q t to qj whose 


intermediate vertices lie in 

(?i 

, . . 

q k ). We 

construct Pj] for k = 0, 

l. .... 

n recursively as follows: 






pf) - 

r [j 

{a 

€ X| 

8{q i: a) 

= «}) 

(5.3) 

pH = 


e Xj 

S(q h a) 

= q,} U {A} 

(5.4) 

p h 

pk- 

r ik 

‘(4‘ 

L ^ jyk-\ 

) r kj 


(5.5) 


In terms of the state diagram, the construction can be understood better. 
Pjj simply denotes the set of path values (i.e. labels) of edges from qi to qj. 
In p9 we include A in addition to labels of self-loops from q v This explains 
(5.3) and (5.4). 

Consider a path from to qj whose intermediate vertices lie in 
{< 7 j. . . If the path does not pass through q h then its path value lies in 

Pjj 1 . Otherwise, the path passes through q k possibly more than once. The path 
can be split into several paths with path values nq. h t 2 . . . as in Fig. 5.10. 
w = wpv 2 . . . \\'j. wj is the path value of the path from q { to q k (without 
passing through cj h i.e. q k is not an intermediate vertex). >v 2 . . . .. W/_ } are the 
path values of paths from q k to itself without passing through q k . w/ is the path 
value of the path from q k to qj without passing through q k . So w\ is in P^~ l , 
u' 2 . .. wj_ { are in (P kI ~ l ) * and wj is in P k j~ l . This explains (5.5). 
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Fig. 5.10 A path from q- to Qj. 


We prove that the sets introduced by (5.3)—(5.5) are represented by 
regular expressions by induction on k (for all i and j). Pf} is a finite subset of 
X, say {flj, .... a f }. Then, Py is represented by + a 2 + • * • 4- a v 

Similarly, we can construct p.9 representing p9 . Thus, there is basis for 
induction. 

Let us assume the result for k - 1, i.e. F-*” 1 is represented by a regular 
expression P^T 1 for all i and j. From (5.5), we have P t j = P/jT 1 (P ^ 1 ) * P^f 1 uP^ { . 
So it is obvious that Pjj is represented by = F^" 1 (P^r 1 ) ^ P^" 1 uPj^ 1 . 

Therefore, the result is true for all k. By the principle of induction, the sets 
constructed by (5.3)-(5.5) are represented by regular expressions. 

As Q = {gj. . . q m ), Pdenotes the set of path values of all paths from 

n 

q x to q:. If F = {q f , .. qt }, then T(M) = y ■ So T(M) is represented 

by the regular expression P/” + ■ • • + P/^ . Thus, L = T(M) is represented 
by a regular expression. 

Note: Pfj and p9 are the subsets of X u {A}, and so they are finite sets. So 

every P/j is obtained by applying union, concatenation and closure to the set 
of all singletons in X u {A}. Using this we prove Kleene's theorem (Theorem 
5.4) at the end of this section. Kleene's theorem characterizes the regular sets 
in terms of subsets of X and operations (union, concatenation, closure) on 
singletons in X u {A}. 


5.2.3 Conversion of Nondeterministic Systems to 
Deterministic Systems 

The construction we are going to give is similar to the construction of a DFA 
equivalent to an NDFA and involves three steps. 

Step 1 Convert the given transition system into state transition table where 
each state corresponds to a row and each input symbol corresponds to a 
column. 

Step 2 Construct the successor table which lists the subsets of states 
reachable from the set of initial states. Denote this collection of subsets by Q\ 

Step 3 The transition graph given by the successor table is the required 
deterministic system. The final states contain some final state of NDFA, If 
possible, reduce the number of states. 
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Note: The construction is similar to that given in Section 3.7 for automata 
except for the initial step. In the earlier method for automata, we started with 
[q 0 ]. Here we start with the set of all initial states. The other steps are similar. 


EXAMPLE 5.7 


Obtain the deterministic graph (system) equivalent to the transition system 
given in Fig. 5.11. 



Fig. 5.11 Nondeterministic transition system of Example 5.7. 

Solution 

We construct the transition table corresponding to the given nondeterministic 
system. It is given in Table 5.1. 


TABLE 5.1 Transition Table for Example 5.7 


State/I. 

a 

b 



Qh Q 2 

Qi 


Qo 

@ 

QO; dl 



We construct the successor table by starting with [q 0 , q{\. From Table 5.1 
we see that [q 0 . q h q 2 ] is reachable from [q 0 , q\\ by a &-path. There are no 
u-paths from [q$, q Similarly, [q 0 , q{\ is reachable from [q 0 , q h q 2 ] by an 
< 2 -path and [ q 0 , q 2 ] is reachable from itself. We proceed with the 

construction for all the elements in Q\ 

We terminate the construction when all the elements of Q appear in the 
successor table. Table 5.2 gives the successor table. From the successor table 
it is easy to construct the deterministic transition system described by Fig. 5.12 

TABLE 5.2 Deterministic Transition Table for Example 5.7 
Q a b 

too, 9i] 0 

fao. 9i. 

0 


too. t?i] 


too, du dil 
too, Qi. <72] 



148 5 Theory of Computer Science 



Fig, 5.12 Deterministic transition system for Example 5.7. 

as q 0 and q 2 are the final states of the nondeterministic system [< q 0 . q } ] and 
[q {y q v q 2 ] are the final states of the deterministic system. 


5.2,4 Algebraic Method Using Arden’s Theorem 

The following method is an extension of the Arden's theorem (Theorem 5.1). 
This is used to find the r.e. recognized by a transition system. 

The following assumptions are made regarding the transition system; 

(i) The transition graph does not have A-moves. 

(ii) It has only one initial state, say Vj. 

(iii) Its vertices are v x . .. v n . 

(iv) V/ the r.e. represents the set of strings accepted by the system even 
though v/ is a final state. 

(v) a,j denotes the r.e. representing the set of labels of edges from v, to 
v h When there is no such edge. a i; - 0. Consequently, we can get the 
following set of equations in Vj ... V„; 

V l = V } a n + V 2 a :i + ■ • • + + A 

V 2 = \ i CX12 \ 2 OC 2 2 + • * * + V /? (X ;?2 

V„ = V,a,„ + V ; a 2;i + • • • + V n a, m 

Bv repeatedly applying substitutions and Theorem 5.1 (Arden's theorem), 

we can express V/ in terms of G^-'s. 

For getting the set of strings recognized by the transition system, we have 
to take the ‘union* of all V/s corresponding to final states. 


EXAMPLE 5.8 


Consider the transition system given in Fig. 5,13. Prove that the strings 
recognized are (a + a(b + aa)*b)* a(h + aa)* a. 
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a b 



Fig. 5.13 Transition system of Example 5.8. 


Solution 

We can directly apply the above method since the graph does not contain any 
A-move and there is only one initial state. 

The three equations for q h q 2 and q 2 can be written as 

q { = q x a + q 2 b + A, q 2 = qja + q 2 b + q 3 a, q 3 = q 2 a 

It is necessary to reduce the number of unknowns by repeated substitution. By 
substituting q 3 in the g r equation, we get by applying Theorem 5.1 

q 2 = qja + q 2 b + q 2 aa 
= qja + q 2 (b + aa) 

= qjatb + aa)* 

Substituting q 2 in q h v/e get 

q { = q t a + qja(b + aa)*b + A 
= q^a + a(b + aa)*b) + A 

Hence, 

qi = A(a + a(b + aa)*b)* 
q 2 = (a + a(b + aa)*b)* a(b + aa)* 
q 3 = (a + a(b + aa)*b)* a(b + aa)*a 
Since is a final state, the set of strings recognized by the graph is given by 
(a + a(b + aa)*b)*a(b + aa)*a 


EXAMPLE 5.9 


Prove that the finite automaton whose transition diagram is as shown in 
Fig. 5,14 accepts the set of all strings over the alphabet { a t b } with an equal 
number of s and b' s, such that each prefix has at most one more a than the 
V s and at most one more b than the a’s. 


a 



Fig. 5.14 Finite automaton of Example 5.9. 
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Solution 

We can apply the above method directly since the graph does not contain the 
A-move and there is only one initial state. We get the following equations for 

qb ^ q 4 * 

qi - q 2 b + q 3 a + A 
q 2 = qi a 

q- = qib 

q 4 = q 2 a + q ? b + q 4 a + q 4 b 

As q L is the only final state and the q r equation involves only q 2 and q 3 , we 
use only q 2 - and q 3 -equations (the q 4 -equation is redundant for our purposes). 
Substituting for q 2 and q 3 , we get 

q } = q^b + qjba + A = q t (ab + ba) + A 
By applying Theorem 5.1, we get 

q { = A(ab 4- ba)* = (ab + ba)* 

As q { is the only final state, the strings accepted by the given finite automaton 
are the strings given by (ab + ba)*. As any such string is a string of ab' s, 
and ba' s, we get an equal number of as and b' s. If a prefix x of a sentence 
accepted by the finite automaton has an even number of symbols, then it 
should have an equal number of as and V s since x is a substring formed by 
ab 's and ba s. If the prefix x has an odd number of symbols, then we can write 
x as ya or yb. As y has an even number of symbols, y has an equal number 
of as and V s. Thus, x has one more a than b or vice versa. 


EXAMPLE 5.10 


Describe in English the set accepted by the finite automaton whose transition 
diagram is as shown in Fig. 5.15. 


0 1 0,1 



Fig. 5.15 Finite automaton of Example 5.10. 


Solution 

We can apply the above method directly as the transition diagram does not 
contain more than one initial state and there are no A-moves. We get the 
following equations for q l5 q 2 , q 3 . 

q ] = qiO + A 

q: = qil + q 2 l 

q? = q:0 + q 3 (0 + 1) 
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By applying Theorem 5.1 to the q r equation, we get 

qi = AO* = 0* 


So, 

q 2 = qil + q 2 l = 0*1 + q 2 l 

Therefore, 

q 2 zz (0*1)1* 

As the final states are qi and q 2 , we need not solve for q 3 : 

qi + q 2 = 0* + 0*(11*) = 0*(A + 11*) = 0*(1*) by I 9 

The strings represented by the transition graph are 0*1*. We can interpret the 
strings in the English language in the following way: The strings accepted 
by the finite automaton are precisely the strings of any number of 0’s (possibly 
A) followed by a string of any number of Vs (possibly A). 


EXAMPLE 5.11 


Construct a regular expression corresponding to the state diagram described by 
Fig. 5.16. 



Solution 

There is only one initial state. Also, there are no A-moves. The equations are 

= q } 0 + q 3 0 + A 
q 2 = qil + q 2 l + q 3 l 

qs = q^o 

So. 

q 2 = q } l + q 2 l + (q 2 0)l = q } l + q 2 (l + 01) 

By applying Theorem 5.1. we get 

q 2 = qjl(l + 01)* 
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Also, 

= q;0 + q 3 0 + A = q|0 + q 2 00 + A 
= qiO 4 (qil(l 4 01)*)00 + A 
= qi(0 + 1(1 4- 01)* 00) 4 A 
Once again applying Theorem 5.1, we get 

qi = A(0 + 1(1 4 01)* 00)* = (0 4 1(1 + 01)* 00)* 

As qi is the only final state, the regular expression corresponding to the given 
diagram is (0 4 1(1 + 01)* 00)*. 


EXAMPLE 5.12 


Find the regular expression corresponding to Fig. 5.17. 

0 1 



Fig. 5.17 Finite automaton of Example 5.12. 


Solution 

There is only one initial state, and there are no A-moves. So, we form the 
equations corresponding to q l5 q 2 , q 3 , q 4 : 

qj = q } 0 4- q 3 0 + q 4 0 4 A 

q 2 = q L l 4 q 2 l + q 4 l 
q 3 = 

^4 = q?i 

Now. 

q 4 = q 3 l = (q 2 0)l = q 2 01 

Thus, we are able to write q 3 . q 4 in terms of q 2 . Using the q 2 -equation, we 
get 

q 2 = qjl 4- q 2 l 4 q 2 011 = qil 4 q 2 (l 4 Oil) 




Chapter 5: Regular Sets and Regular Grammars B ) S3 


By applying Theorem 5.1. we obtain 

q 2 = (ql)(l + Oil)* = qj(l(l + Oil)*) 

From the q r equation, we have 

q^ = qjO + q 2 O0 + q 2 010 + A 
~ qiO + q 2 (O0 + 010) ■+■ A 
= ^0 + qld + Oil)* (00 + 010) + A 
Again, by applying Theorem 5.1. we obtain 

q! = A(0 + 1(1 + Oil)* (00 + 010))* 
q 4 = q 2 01 = qjld + Oil)* 01 

= (0 + 1(1 + 011)*(00 + O10))*(l(l + Oil)* 01) 

5.2.5 Construction of Finite Automata Equivalent 
to a Regular Expression 

The method we are going to give for constructing a finite automaton 
equivalent to a given regular expression is called the subset method which 
involves two steps. 

Step 1 Construct a transition graph (transition system) equivalent to the 
given regular expression using A-moves. This is done by using Theorem 5.2. 

Step 2 Construct the transition table for the transition graph obtained in 
step 1. Using the method given in Section 5.2.3, construct the equivalent DFA. 
We reduce the number of states if possible. 


EXAMPLE 5.13 


Construct the finite automaton equivalent to the regular expression 

(0 + 1)*(00 + 11)(0 + 1 )* 

Solution 

Step 1 (Construction of transition graph) First of all we construct the 
transition graph with A-moves using the constructions of Theorem 5.2. Then 
we eliminate A-moves as discussed in Section 5.2,1. 

We start with Fig. 5.18(a). 

We eliminate the concatenations in the given r.e. by introducing new 
vertices q x and c / 2 and get Fig. 5.18(b). 

We eliminate the * operations in Fig. 5.18(b) by introducing two new 
vertices q 5 and g 6 and the A-moves as shown in Fig. 5.18(c), 

We eliminate concatenations and + in Fig. 5.18(c) and get Fig. 5.18(d). 
We eliminate the A-moves in Fig. 5.18(d) and get Fig. 5.18(e) which 
gives the NDFA equivalent to the given i.e. 
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Fig, 5,18 Construction of finite automaton equivalent to (0 + 1)*(00 + 11)(0 + 1)*. 

Step 2 (Construction of DFA) We construct the transition table for the 
NDFA defined by Table 5.3. 


TABLE 5.3 Transition Table for Example 5.13 


State/1 

0 

1 

9o 

9o> 9s 

9o> 94 

93 

9f 


94 


9 f 

® 

9f 

Qf 
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The successor table is constructed as given in Table 5.4. 

TABLE 5.4 Transition Table for the DFA of Example 5.13. 

Q 

Qo 

Qi 

—> iQo) 

too, < 73 ] 

too, < 74 ] 

too, <7s] 

too. < 73 , Qf] 

too. < 74 ] 

fob. <74] 

too, < 73 ] 

too, <74. Qf] 

[<7o. <73, <7 f] 

too. < 73 , Qf] 

too, < 74 , Qf] 

too. < 74 . <?,.] 

[Qo, <73, Qt] 

[Qo, <74, Qf] 


The state diagram for the successor table is the required DFA as described by 
Fig. 5.19. As q f is the only final state of NDFA, [q& q^, q f ] and [q 0 , q 4 , q f ] 
are the final states of DFA. 



Finally, we try' to reduce the number of states. (This is possible when two 
rows are identical in the successor table.) As the rows corresponding to 
[<?<> <??• q/] and [q 0 , q± q f ] are identical, we identify them.. The state diagram 
for the equivalent automaton, where the number of states is reduced, is 
described by Fig. 5.20. 



Fig. 5.20 Reduced finite automaton of Example 5.13. 


Note: While constructing the transition graph equivalent to a given r.e., the 
operation (concatenation, *, +) that is eliminated first, depends on the regular 
expression. 
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EXAMPLE 5,14 


Constmct a DFA with reduced states equivalent to the r.e. 10 + (0 + 11))0*1. 

Solution 

Step 1 (Construction of NDFA) The NDFA is constructed by eliminating the 
operation +. concatenation and *. and the A-moves in successive steps. The 
step-by-step construction is given in Figs. 5.21(a)-5.21(e). 



(b) Elimination of +. 

(V) 



(c) Elimination of concatenation and *. 



(e) Elimination of A-moves. 

Fig. 5.21 Construction of finite automaton for Example 5.14. 
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Step 2 (Construction of DFA) For the NDFA given in Fig. 5.18(e). the 
corresponding transition table is defined by Table 5.5. 


TABLE 5.5 Transition Table for Example 5.14 


State/1 

0 

1 

Qo 

93 

9i. 92 

9i 

Qf 


92 


93 

93 

JD 

93 

Qf 

The successor table is constructed and given 

in Table 5.6. 

In Table 5.6 the columns corresponding to [q t ] and 0 are identical. 

identify [q t ] and 0. 



TABLE 5.6 Transition Table of DFA for Example 5.14 

Q 

Qo 

Qi 

-» [<*j 

fed 

[Qi. Q 2 ] 


tool 

[Qf] 

[«7l %] 

[Qf] 

foal 

@ 

0 

0 

0 

0 

0 


The DFA with the reduced number of states corresponding to Table 5.6 
is defined by Fig. 5.22. 



Fig. 5.22 Reduced DFA of Example 5.14. 


5.2.6 Equivalence of Two Finite Automata 

Two finite automata over Z are equivalent if they accept the same set of strings 
over Z. When the two finite automata are not equivalent, there is some string 
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w over Z satisfying the following: One automaton reaches a final state on 
application of w t whereas the other automaton reaches a nonfinal state. 

We give below a method, called the comparison method . to test the 
equivalence of two finite automata over Z. 

Comparison Method 

Let M and M' be two finite automata over Z. We construct a comparison table 
consisting of n + I columns, where n is the number of input symbols. The first 
column consists of pairs of vertices of the form {q, q'), where q € M and q 
s M'. If {q, q) appears in some row of the first column, then the 
corresponding entry in the ^-column [a e Z) is (q a , q«), where q a and q' a are 
reachable from q and q, respectively on application of a (i.e, by a-paths). 

The comparison table is constructed by starting with the pair of initial 
vertices q in , q'- n of M and M' in the first column. The first elements in the 
subsequent columns are (q a , q' a ), where q a and q' u are reachable by a-paths 
from q m and q[ lv We repeat the construction by considering the pairs in the 
second and subsequent columns which are not in the first column. 

The row-wise construction is repeated. There are two cases: 

Case 1 If we reach a pair (q, cf) such that q is a final state of M. and q is 
a non final state of M' or vice versa, we terminate the construction and 
conclude that M and M' are not equivalent. 

Case 2 Here the construction is terminated when no new element appears in 
the second and subsequent columns w 7 hich are not in the first column (i.e. 
when all the elements in the second and subsequent columns appear in the first 
column). In this case we conclude that M and M' are equivalent. 


EXAMPLE 5.15 


Consider the following two DFAs M and M' over {0, 1} given in Fig. 5.23. 
Determine whether M and M' are equivalent. 
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Solution 

The initial states in M and M' are and q± respectively. Hence the first 
element of the first column in the comparison table must be (q h q 4 ). The first 
element in the second column is (q h q 4 ) since both q { and q 4 are oreachable 
from the respective initial states. The complete table is given in Table 5.7. 


TABLE 5.7 Comparison Tabie for Example 5.15 


ft, o') 

ft- <7c) 

fta. Qd) 

fti, < 7 4 ) 

fti, < 74 ) 

(<72, <7s) 

(Ster <fe) 

fts, <7e) 

fti. 94) 

(%.■ Vs) 

(Q 2 , q-) 

(%, Qe) 

(<?2- <fr) 

fts- %) 

fti, <74) 


As we do not get a pair (q, q), where q is a final state and q is a nonfinal 
state (or vice versa) at ever}' row. we proceed until all the elements in the 
second and third columns are also in the first column. Therefore. M and M' 
are equivalent. 



Show that the automata M\ and M 2 defined by Fig. 5.24 are not equivalent. 


c c 



Fig. 5.24 (a) Automaton M i and (b) automaton M 2 . 


Solution 

The initial states in M\ and M 2 are q { and r/ 4 , respectively. Hence the first 
column in the comparison table is (q b q 4 ). q 2 and q 5 are cAreachable from q x 
and c? 4 . We see from the comparison table given in Table 5.8 that q { and r/ 6 
arc ^-reachable from q 2 and q^ respectively. As r/] is a final state in and 
r/ 6 is a nonfinal state in M 2 , we see that Mi and M 2 are not equivalent: we can 
also note that q x is r/VAreachable from q h and hence del is accepted by M { , dd 
is not accepted by M 2 as only r/ 6 is rM-reachable from q± but q 6 is nonfinal. 
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TABLE 5.8 

Comparison Table for Example 5.16 

(Q. <0 

(<7c> <7 o) 

(<?<*. <7d) 

(<7i, Qa) 

(<7l- Qa) 

(<72. < 75 ) 

(<72. <7s) 

(<73. <7?) 

(<7i. Qe) 


5.2.7 Equivalence of Two Regular Expressions 

Suppose we are interested in testing the equivalence of two regular 
expressions, say P and Q. The regular expressions P and Q are equivalent iff 
they represent the same set. Also, P and Q are equivalent iff the corresponding 
finite automata are equivalent. 

To prove the equivalence of P and Q, (i) we prove that the sets P and Q 
are the same. (For nonequivalence we find a string in one set but not in the 
other.) Or (ii) we use the identities to prove the equivalence of P and Q. Or 
(iii) we construct the corresponding FA M and M' and prove that M and M'are 
equivalent. (For nonequivalence we prove that M and M' are not equivalent.) 

The method to be chosen depends on the problem. 


EXAMPLE 5.17 


Prove (a + b)* = a*(ba*)*. 

Solution 

Let P and Q denote (a + b)* and a*(ba*)*, respectively. Using the construction 
in Section 5.2.5, P is given by the transition system depicted in Fig. 5.25. 

Fig. 5.25 Transition system for (a + b)* 

The transition system for Q is depicted in Fig. 5.26. 

It should be noted that Figs. 5.25 and 5.26 are obtained after eliminating 
A-moves. As these two transition diagrams are the same, we conclude that 

p = Q 

We now summarize all the results and constructions given in this section. 

(i) Every r.e. is recognized by a transition system (Theorem 5.2). 

(ii) A transition system M can be converted into a finite automaton 
accepting the same set as M (Section 5.2.3). 

(iii) Any set accepted by finite automaton is represented by an r.e. 
(Theorem 5.3). 

(iv) A set accepted by a transition system is represented by an r.e. (from 
(ii) and (iii)). 
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Fig. 5.26 Transition system for a*(ba*)*. 

(v) To get the r.e. representing a set accepted by a transition system, we 
can apply the algebraic method using the Arden's theorem (see 
Section 5.2.4). 

(vi) If P is an r.e. r then to construct a finite automaton accepting the set 
P. we can apply the construction given in Section 5.2.5. 

(vii) A subset L of 1* is a regular set (or represented by an r.e.) iff it is 
accepted by an FA (from (i). (ii) and (iii)). 

(viii) A subset L of E* is a regular set iff it is recognized by a transition 
system (from (i) and (iv)). 

(ix) The capabilities of finite automaton and transition systems are the 
same as far as acceptability of subsets of strings is concerned. 

(x) To test the equivalence of two DFAs. we can apply the comparison 
method given in Section 5.2.6. 

We conclude this section with the Kleene's theorem. 

Theorem 5.4 (Kleene’s theorem) The class of regular sets over E is the 
smallest class 72 containing {a} for every' a e E and closed under union, 
concatenation and closure. 

Proof The set {a} is represented by the regular expression a. So {a} is 
regular for every a e E. As the class of regular sets is closed under union, 
concatenation, and closure. 72 is contained in the class of regular sets. 
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Let L be a regular set. Then L = T(M) for some DFA, M - ({q h 
q m ), X, <5 q Q , F). By Theorem 5.3. 

L = U P\f 

j =i ; 

where F = {cy^ . . . q t } and F[ l f. is obtained by applying union, concatenation 
and closure to singletons in X. Thus, L is in 1Z . I 

5.3 PUMPING LEMMA FOR REGULAR SETS 

In this section we give a necessary condition for an input string to belong to 
a regular set. The result is called pumping lemma as it gives a method of 
pumping (generating) many input strings from a given string. As pumping 
lemma gives a necessary condition, it can be used to show that certain sets are 
not regular. 

Theorem 5.5 (Pumping Lemma) Let M = (Q, X, <5, q 0> F ) be a finite 
automaton with n states. Let L be the regular set accepted by M . Let w e L 
and | w | > m. If m > n, then there exists x, y, z such that w - xyz, y F A and 
xy l z 6 L for each i > 0. 

Proof Let 

W = a j«2 • ■ ■ a m' m - n 

S(q 0 , a,a 2 . .. a ; ) = cy, for / = 1, 2. .. m; Qi = {<y 0 . • ■ •> fel 

That is, Qi is the sequence of states in the path with path value w = a x a 2 . . . a m . 
As there are only n distinct states, at least two states in Q { must coincide. 
Among the various pairs of repeated states, we take the first pair. Let us take 
them as qj and q k (qj = q k ). Then j and k satisfy the condition 0 < j < k < n. 

The string vv can be decomposed into three substrings a x a 2 . . . aj, aj +1 . . . 
a k and a k+l . . . a m . Let x, y t z denote these strings a^a 2 ... a- p a /+] . . . a k , 
a k+ 1 . . . a m , respectively. As k < n, \xy\ < n and w = xyz. The path with the 
path value w in the transition diagram of M is shown in Fig. 5.27. 

The automaton M starts from the initial state q 0 . On applying the string 
a; it reaches qf=q k ). On applying the string y, it comes back to qf - q k ). So 
after application of y l for each i > 0, the automaton is in the same state q y 
On applying c. it reaches q mt a final state. Hence. xy*z 6 L. As every state in 
Q x is obtained by applying an input symbol, y ^ A. I 



Fig. 5.27 String accepted by M. 
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Note: The decomposition is valid only for strings of length greater than or 
equal to the number of states. For such a string w - at z, we can ‘iterate’ the 
substring y in xyz as many times as we like and get strings of the form xy l z 
which are longer than xyz and are in L By considering the path from q 0 to 
q k and then the path from q k to q rn (without going through the loop), we get 
a path ending in a final state with path value xz. (This corresponds to the case 
when i - 0.) 


5.4 APPLICATION OF PUMPING LEMMA 

This theorem can be used to prove that certain sets are not regular. We now 
give the steps needed for proving that a given set is not regular. 

Step 1 Assume that L is regular. Let n be the number of states in the 

corresponding finite automaton. 

Step 2 Choose a string vr such that j w \ > n . Use pumping lemma to write 
\v = xyz, with | at j < n and | y | > 0. 

Step 3 Find a suitable integer i such that xy'z £ L. This contradicts our 

assumption. Hence L is not regular. 

Note: The crucial part of the procedure is to find i such that xy*z & L In 
some cases we prove \y l z £ L by considering | xy ! z |. In some cases we may 
have to use the ‘structure’ of strings in L 


EXAMPLE 5.18 


O . 

Show' that the set L = {a l ~ j i> 1} is not regular. 


Solution 


Step 1 Suppose L is regular. Let n be the number of states in the finite 
automaton accepting L 

Step 2 Let vv = d v . Then \ w\ = n 2 > n. By pumping lemma, we can write 
w = xyz with |at| < n and jv| > 0. 

Step 3 Consider at 2 :;. |.rv ,: c| = |a'| + 2|y| + \z\ > |a| + |y| + \z\ as 
| v j > 0. This means n 2 = \xyz \ = |a| + j y\ + \z\ < \ xy~z |. As \xy\ < n, 
we have | y j < n. Therefore. 


\xy~z\ = I x | + 2|yj + \z\ < n 2 + n 
i.e. 

n~ < | at“c |<w + ;?< ir + n + n + 1 

Hence, | xy~z | strictly lies between n 2 and (n + l) 2 but is not equal to any 
one of them. Thus j ay 2 :; | is not a perfect square and so at 2 - g L. But by 
pumping lemma, av 2 - e L. This is a contradiction. 
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EXAMPLE 5.19 


Show that L - {cf \ p is a prime} is not regular. 

Solution 

Step 1 We suppose L is regular. Let n be the number of states in the finite 
automaton accepting L. 

Step 2 Let p be a prime number greater than n. Let w = af. By pumping 
lemma, vr can be written as w - xyz, with | xy | < n and \ y \ > 0. a, y t z are 
simply strings of cf s. So, y - d n for some m > 1 (and < ri). 

Step 3 Let i = p + 1. Then | xy l z | = | xyz | + |y M ] =/? + (/- 1 )m - p + 
pm. By pumping lemma, x\' l z £ L But | x\ J z | = p + pm = p( 1 + m), and p(! 
+ m) is not a pnme. So atL g L. This is a contradiction. Thus L is not regular. 


EXAMPLE 5.20 


Show that L = {0T|/ > 1} is not regular. 

Solution 

Step 1 Suppose L is regular. Let n be the number of states in the finite 
automaton accepting L. 

Step 2 Let w = 0"1". Then |w| = 2n > m By pumping lemma, we write 
w = atc with |at| < « and |v| ^ 0. 

Step 3 We want to find i so that xy*z £ L for getting a contradiction. The 
string y can be in any of the following forms: 

Case 1 y has 0 ! s. i.e. v - 0 k for some k > 1. 

Case 2 y has only Ls, i.e. v = l l for some 1 > 1. 

Case 3 y has both O's and Ls, i.e. y = 0 k V for some k, j > 1. 

In Case L we can take i = 0. As xyz = 0 /? l' 2 , xz - O^l”. As k > 1, n - 
k ^ n. So, xz £ L. 

In Case 2, take i - 0. As before, xz is and n -£ n - l So. at & L. 

In Case 3. take i = 2. As xyz = 0 n - k 0 k l J r~k xrz = 0 n ~ k 0T¥lT /? T As at 2 - 
is not of the form 0 r L, xy 2 z £ L. 

Thus in all the cases we get a contradiction. Therefore, L is not regular. 


EXAMPLE 5.21 


Sluwv that L = {\y\v \ vv e {a, b}*} is not regular. 

Solution 

Step 1 Suppose L is regular. Let n be the number of states in the automaton 
M accepting L. 
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Step 2 Let us consider \vw = a u ba n h in L. | wvv | = 2(n + 1) > n. We can 
apply pumping lemma to write mv - xyz with | y | ^ 0. ] xy j < n. 

Step 3 We want to find i so that xy l z £ L for getting a contradiction. The 
string y can be in only one of the following forms: 

Case 1 y has no ms, i.e. y = a k for some k > 1. 

Case 2 y has only one b. 

We may note that y cannot have two V s. If so, |y| > n + 2. But \y\ < 
| at | < n. In Case 1, we can take / = 0. Then xy°z = xz is of the form d n ba n b, 
where m = n — k < n (or a ll ba tn b). We cannot write xz in the form uu with 
it e { a , £>}*. and so xz £ L In Case 2 too, we can take i - 0. Then xy°z =xz 
has only one b (as one b is removed from xyz* b being in y). So xz € L as 
any element in L should have an even number of a 's and an even number of 
fe’s. 

Thus in both the cases we get a contradiction. Therefore. L is not regular. 

Note: If a set L of strings over I is given and if we have to test whether 
L is regular or not, we try to write a regular expression representing L using 
the definition of L If this is not possible, we use pumping lemma to prove 
that L is not regular. 


EXAMPLE 5.22 


Is L = { cr n | n > 1} regular? 

Solution 

We can write a 2}1 as a(crfa } where i > 0. Now {(a 2 ) 2 1 i > 0} is simply {rr}*. 
So L is represented by the regular expression a(P)*a, where P represents {a~). 
The corresponding finite automaton (using the construction given in Section 
5.2,5) is shown in Fig. 5.28. 



5 = 5 CLOSURE PROPERTIES OF REGULAR SETS 

In this section we discuss the closure properties of regular sets under (i) set 
union, (ii) concatenation, (iii) closure (iteration), (iv) transpose, (v) set 
intersection, and (vi) complementation. 
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In Section 5.1. we have seen that the class of regular sets is closed under 
union, concatenation and closure. 

Theorem 5.6 If L is regular then lJ is also regular. 

Proof As L is regular by (vii). given at the end of Section 5.2.7. we can 
construct a finite automaton M = (ft Z, <5, q 0 , F) such that T(M) - L. 

We construct a transition system M' by starting with the state diagram of 
M, and reversing the direction of the directed edges. The set of initial states 
of M e is defined as the set F, and q 0 is defined as the (only) final state of M\ 
i.e. M' = (ft Z, 8\ F. {q 0 }). 

If w e T(M ), we have a path from q 0 , to some final state in F with path 
value n\ By ‘reversing the edges’, we get a path in M' from some final state 
in F to q Q . Its path value is \\ J . So w l e T(M'). In a similar way, we can 
see that if e T(M'), then wf e T(M). Thus from the state diagram it is 
easy to see that T(M') - T(M) 1 , We can prove rigorously that w e T(M) iff 
m t € T(M f ) by induction on jwj. So T(M) t = T(M'). By (viii) of Section 
5.2.7. T(M') is regular, i.e. T(M) r is regular. I 


EXAMPLE 5.23 


Consider the FA M given by Fig. 5.29. What is T(M)? Show that T(M) r is 
regular. 



Solution 


As the elements of T\M) are given by path values of paths from q 0 to itself 
or from q 0 to q\ (note that we have two final states q§ and q { ), we can 
construct T(M) by inspection. 

As arrows do not come into q (h the paths from q 0 to itself are self-loops 
repeated any number of times. The corresponding path values are 0‘ ; / > 1. 
As no arrow comes from q 2 to q 0 or q b the paths from q$ to q\ are of the 
form cjq ... —> qcj . . . q\ ... —> q j. The corresponding path values are OT 7 , 
where i > 0 and j > 1. As the initial state q§ is also a final state, A € T(iVf). 
Thus. 


T(M) = {OT- 7 | f. j > 0} 


T(M) f = {F0M7 j > 0} 


Hence, 



Chapter 5: Regular Sets and Regular Grammars H 167 


The transition system M' is constructed as follows: 

(i) The initial states of MY are q Q and q- v 

(ii) The (only) final state of M' is q 0 . 

(iii) The direction of the directed edges is reversed. AT is given in Fig. 5.30. 
From (i)—(iii) it follows that 

T(M') = T(Mf 

Hence, T(M) r is regular. 



Note: In Example 5.23. we can see by inspection that T(M') - (V 0 ! | i, 
j > 0}. The strings of T(M') are obtained as path values of paths from q 0 to 
itself or from q\ to q§. 

Theorem 5.7 If L is a regular set over Z, then Z* - L is also regular over Z. 

Proof As L is regular by (vii), given at the end of Section 5.2.7, we can 
construct a DFA M - (Q, Z. 8. q$, F) accepting L i.e. L = T(M). 

We construct another DFA M' = ( Q , Z. 5, q 0 , F') by defining F' = Q - F, 
i.e. M and M' differ only in their final states. A final state of M f is a nonfinal 
state of M and vice versa. The state diagrams of M and M f are the same except 
for the final states. 

H- e T(M') if and only if 8 (r/ 0 , \v) e F' = Q - F, i.e. iff w g? L This 
proves T(M') - Z* - X. 1 

Theorem 5.8 If X and Y are regular sets over Z, then X n Y is also regular 
over Z. 

Proof By DeMorgan's law for sets, X n Y = Z* - ((Z^ ~ X) u (Z* ~ l 7 ))- By 
Theorem 5.7. Z* - X and Z* - Y are regular. So, (Z* - X) u (Z^ - 7) is 
also regular. By applying Theorem 5.7, once again Z* - ((Z* - X) u 
(Z* - F)) is regular, i.e. X n Y is regular. I 

5.6 REGULAR SETS AND REGULAR GRAMMARS 

We have seen that regular sets are precisely those accepted by DFA. In this 
section we show that the class of regular'sets over Z is precisely the regular 
languages over the terminal set Z. 



168 a Theory of Computer Science 


5.6.1 Construction of a Regular Grammar 
Generating T(M) for a Given DFA M 

Let M = ({g 0 , ... . q n \, Z> 5, < 7 0 , ^)- If w is in 7TM), then it is obtained by 
concatenating the labels con'esponding to several transitions, the first from q 0 
and the last terminating at some final state. So for the grammar G to be 
constructed, productions should correspond to transitions. Also, there should 
be provision for terminating the derivation once a transition terminating at 
some final state is encountered. With these ideas in mind, we construct G as 

G — ({Aq, A], .. A u }, Z, P, Aq) 

where P is defined by the following rules: 

(i) Aj —» aAj is included in P if Siq^ a) = qj £ F. 

(ii) Aj —> aAj and A/ —> a are included in P if 8(q h a) = q; e F. 

We can show that L(G ) = T(M) by using the construction of P. Such a 
construction gives 

Aj=> aAj iff 8(q h a) = qj 

Aj => a iff 8(q h a) e F 

So. 

Aq => ci\Ai => a { a 2 A 2 => .. . a\ . . . fljt-iAjt ^2 ♦ • • ^ 

iff <5(^o- «i) = «:) = <7:- • ■ •• fli) e ^ 

This proves that vr = a, ... a k e L(G) iff 5(< 70 , flj .. . r/ <: ) e F, i.e. iff 
w e T(M). 


EXAMPLE 5.24 


Construct a regular grammar G generating the regular set represented by 
P = a*b(a + b)*. 

Solution 

We construct the DFA corresponding to P using the construction given in 
Section 5.2.5. The construction is shown in Fig. 5.31. 


Fig. 5.31 DFA of Example 5.24, with A-moves. 

After eliminating the A-moves. we get the DFA straightaway, as shown in 
Fig. 5.32. 
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_6 

Fig. 5.32 DFA of Example 5.24, without A-moves. 

Let G ~ ({AO, A]}, {a, b }. P. A 0 ). where P is given by 
Aq —> gA. q , A q —> bA |. Aq —> b 
A ] —> gA ], A ] —> bA i , A i —> <7, A j —> b 

G is the required regular grammar. 




5.6.2 Construction of a Transition System m 
Accepting L(G) for a Given Regular 
Grammar G 

Let G - ({A 0 , Ai.A m }, X. P, A 0 ). We construct a transition system A/ 

whose (i) states correspond to variables, (ii) initial state corresponds to A 0 , 
and (in) transitions in M correspond to productions in P. As the last 
production applied in any derivation is of the form A i —» a, the corresponding 
transition terminates at a new state, and this is the unique final state. 

We define M as ({q 0 , . . .. q n , qf), X. <5, q 0 , {^}) where S is defined as 
follows: 

(i) Each production A; -a aAj induces a transition from q t to qj with 
label a. 

(ii) Each production A k —» a induces a transition from q k to q k with 
label a. 

From the construction it is easy to see that A 0 => a x A x => a x a 2 A 2 => ■ • • 
=> uq ... G n _\A n _\ =$ ai . . . a n is a derivation of a x a 2 ... iff there is a 
path in M starting from q 0 and terminating in with path value a x a 2 . . . a lv 
Therefore. L(G) = T(M\ 


EXAMPLE 5.25 


Let G - ({A 0 , A]}, { a . b } ? P, A 0 ). where P consists of A 0 oA 1? A t -» bA { , 
A i -A <7, A| —> M 0 . Construct a transition system M accepting L(G ). 

Solution 

Let Af = ({ q 0 , q x . ^}. {m fc}. <5. (/q, where <70 and correspond to Aq 

and A h respectively and q f - is the new (final) state introduced. A 0 —> ciA x 
induces a transition from r / 0 to with label a. Similarly. Aj -a bA x and 
A] bA 0 induce transitions from q x to q x with label b and from q\ to q$ with 



170 B Theory of Computer Science 


label b y respectively, A x —> a induces a transition from q x to qj with label a. 
M is given in Fig, 5.33. 



Qf 


Fig, 5.33 Transition system for Example 5.25. 


EXAMPLE 5.26 


If a regular grammar G is given by S aS | a, find M accepting L(G). 

Solution 

Let q Q correspond to S and q f be the new (final) state. M is given in 
Fig. 5.34. Symbolically. 

M = ({ q 0 , Qf}- {a}. 8, q 0 , [q f \) 


a 



Fig, 5.34 Transition system for Example 5.26. 

Note: If S —> A is in P , the corresponding transition is from q 0 to qf with 
label A. 

By using the construction given in Section 5.2.3, we can construct a DFA 
M accepting L(G) for a given regular grammar G. 


5.7 SUPPLEMENTARY EXAMPLES 


EXAMPLE 5.27 


Find a regular expression corresponding to each of the following subsets of 
{a. b }. 

(a) The set of all strings containing exactly 2<rs. 

(b) The set of all strings containing at least la's. 

(c) The set of all strings containing at most 2a's. 

(d) The set of all strings containing the substring aa. 
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Solution 

(a) b*ab*ab* 

(b) (a + b)*a(a + b)*a(a + b)* 

(c) b*ab*ab* + b*ab* 

(d) (a + b)*aa(a + b)* 


EXAMPLE 5.28 


Find a regular expression consisting of all strings over {a, b} starting with any 
number of a’s, followed by one or more b's. followed by one or more a’s, 
followed by a single b, followed by any number of a’s, followed by b and 
ending in any string of a’s and b's. 

Solution 

The r.e. is a*b b*a a*b(a + b)*. 


EXAMPLE 5.29 


Find the regular expression representing the set of all strings of the form 

(a) a m b n d 1 where m, n, p > 1 

(b) d n b z,l c ip where m, n, p > 1 

(c) a n ba 2m b 2 where m > 0, n > 1 

Solution 

(a) aa*bb*cc* 

(b) aa^(bb)(bb)^ccc(ccc)* 

(c) aa H! b(aa)*bb 


EXAMPLE 5.30 


Find the sets represented by the following regular expressions. 

(a) (a + b)*(aa + bb + ab + ba)* 

(b) (aa)* + (aaa)^ 

(c) (1 + 01 + 001)*(A + 0 + 00) 

(d) a + b(a + b)* 

Solution 

(a) The set of all strings having an odd number of symbols from {a, b}* 

(b) {x e {a}* j | x | is divisible by 2 or 3} 

(c) The set of all strings over {0, 1} having no substring of more than 
two adjacent 0‘s. 

(d) {a. b , ba , bb, baa, bab, bba. bbb, . . .} 
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Show that {w e { a , &}* | w contains an equal number of as and V s} is not 
regular. 


Solution 

We prove this by contradiction. Assume that L = T\M) for some DFA M with 
n states. Let w - d l b n e L and | wql = 2 n . Using the pumping lemma, we write 
w = xy>z with | xy | < n and | v j > 0. As xyz = d l bt\ xy = o* where i < /z and 
hence y = d for some j, 1 < j < n. Consider xy 2 z- Now xyz has an equal number 
of d s and b's. But xy 2 z has (n + j) a ’s and n b's. As n +/ * /?, £ L. 

This contradiction proves that L is not regular. 


EXAMPLE 5.32 


Show that L = {dtic k | k > i + /} is not regular. 

Solution 

We prove this by contradiction. Assume L = T(M) for some DFA with n 
states. Choose w = d l b n c* u in L, Using the pumping lemma, we write w = atz 
with | xy j < n and j v j > 0. As h = a n b n c' v \ xy = d for some i < n. This means 
that y - d for some j. 1 < j < n. Then xy' M z - d^ k b n e n . Choosing k large 
enough so that n + jk > 2n, w 7 e can make n + jk + n > 3n. So, xy* +i z € L. 
Hence L is not regular. 


EESEBBa 

Prove the identities / 5 . / 6 , / 7 , / 8 , 7 n / 12 given in Section 5.1.1. 

Proof L (R + R) = L(R) u L(R) = L(R). Hence / 5 . 

L(R*R*) = L(R*)L(R*) = {uqwqjwq, uq e L(R*)}. But nq = x^xq . . . 
x n e L( R)* for A'/ e L(R), i = 1. 2, . . ., m Similarly, viq = y L y 2 . . . y m e 
L(R*) for y,- e L(R). j = L 2. .... m. So >iqnq 6 L(R)*. proving / 6 . 

An element of L(RR*) is of the form xyj . . . y n for some 
x, V|. y n e L( R). 

As xy'i . . . y n = (xyq .. .y„_i)v„ € L(R + R), / 7 follows. 

It is easy to see that L(R*) c L((R*)*). Take w e L((R*)*). Then 
w = xi ... x,„ where x,- € R*, / = 1, 2. .. m. Each x 2 - in turn can be 
written in the form vqxs .. . y n for some y,- e L(R). i = L 2, . . .. /i. So, 
W = Ai . . . A m = Z\ ... Z k € L( R*). 

(Wore; Aq = ci ... c r; x 2 = c f +i ... z l+! etc.) 

Hence / 8 . 

To prove 7^, take m e L(P + Q)* Then w = uq . . . w n where \v t e 

L(P) u L(Q). By writing vq = = uy\, we note that e Hence 

w € L(P*Q*)*. To prove L(P*Q*)* c L(P 4- Q)*, take w € L(P*Q*)*. 
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Then, w = w, ... w„ where vv ( e P*Q*. For simplicity lake n = 2. (The result 
can be extended by induction for any n ). Then Wj = x l .\ 2 ■ ■ ■ x b where 
x, e L(P ) and us = y L v 2 ... » where _Vy e L( Q). So vr = x x x 2 . . . -X*>Tb '2 • • • TV- 
Each Xj or y ; is in L(P + Q). Hence w e L(P + Q)*, proving the first identity 
in I u . The second identity can be proved in a similar way. 

Finally, we prove I ]2 . 

L(( P + Q)R) = L(P + Q)L(R) 

= <L(P) v L(Q))L(R) 

L (PR + QR) = L(PR) u L(QR) 

= (L(P)L(R)) u (L(Q)L(R)) 

But (A u B)C = AC u BC for A, B, C c L*. For, a string w in 
(A u B)C is the concatenation of a string w, in A or B and a string w 2 in C. 
If wj e A, then 6 AC; if vr, e B. then u’jWi e BC. Hence w e AC u 
BC. The other inclusion can be proved similarly. / 12 follows from (A u B)C 
= AC u BC. 


EXAMPLE 5.34 


Prove that P + PQ*Q = a*bQ* where P = b + aa*b and Q is any regular 
expression. 

Proof L.H.S. = PA + PQ*Q 

= P(A + Q*Q) 

= PQ* 

= (b + aa*b)Q* 

= (Ab + aa^blQ^ 

= (A + aa*)bQ* 

= a*bQ* 

= R.H.S. 


by h 
by I l2 
by I g 

by definition of P 
by A 
by / 12 
by 1 9 


EXAMPLE 5.35 


Construct a regular grammar accepting L - {w g {a, b}* | w is a string over 
{a. b) such that the number of V s is 3 mod 4}. 

Solution 

We construct a DFA M accepting L directly. The symbol a can occur in any 
place in vr and b has to occur in 4 k + 3 places, where k > 0. So we can have 
states q h / = 0, 1, 2. 3. for remembering that the string processed so far has 
4k, 4k + 1. 4k + 2 and 4k + 3 b's (k > 0). q 3 is the only final state. Also M 
does not change state on reading a' s. The state diagram representing M is 
given in Fig. 5.35. 
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a a 



By applying the construction given in Section 5.6.1. we can construct a 
regular grammar G accepting L = T(M). 

G = ({A 0 , A h A 2 * A 3 }. { a . b }, P. A 0 ) where P consists of A 0 -> aA 0 , 
Aq —> fcAj, A| —> fcA], Ai —> &A 2 , A 2 —> &A 2 , A 2 —> bA 2 . A 2 —> A 3 —> ^ 3 , 

A3 — ^ nA(). 


EXAMPLE 5.36 


Let G = ({A 0 , A t , A : . A 3 }, {a, £>}, P, A 0 ), where P consists of A 0 -> 
aA 0 | bA h Aj -> «A 2 | aA 3 , A 3 -4 a \ bA { | M 3 , A 3 -» b \ bA 0 . Construct an 
NDFA accepting L(G). 

Solution 

The NDFA accepting L = L(G ) is M where M = ({r/ 0 > < 7 i> <h-. <? 4 )> b}, 

S. <?o, {<y 4 }) - 5 is described by the state diagram shown in Fig. 5.36. 


b 



Fig. 5.36 NDFA for Example 5.36. 
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SELF-TEST 


Choose the correct answer to Questions 1-10. 


1. The set of all strings over {a, b} of even length is represented by the 
regular expression 

(a) (ab + aa 4 bb 4 ba)* (b) (a + b)*(a* 4 b)* 

(c) (aa 4 bb)* (d) (ab 4 ba)* 


2. The set of all strings over {a , b } of length 4, starting with an a is 
represented by the regular expression 


(a) a(a + b)* 

(c) (ab 4 ba)(aa 4 bb) 

3. (0*1*)* is the same as 
(a) (0 + 1 )* 

(c) (10)* 


(b) a(ab)* 

(d) a(a + b)(a 4 b)(a 4 b) 

(b) (01)* 

(d) none of these. 


4. If L is the set of all strings over {a. b] containing at least one a , then 
it is not represented by the regular expression 

(a) b*a(a 4 b)* (b) (a 4 b)*a(b 4 a)* 

(c) (a + b)*ah* (d) (a 4 b)*a 

5 . {a 2n \ n > 1 } is represented by the regular expression 

(a) (aa)* (b) a* 

(c) aa*a (d) a*a* 


6. The set of strings over { a , b } having exactly 3fe ? s is represented by the 
regular expression 

(a) a*bbb (b) a*ba*ba*b 

(c) ba*ba*b (d) a*ba*ba*ba* 


7. The set of all strings over {a, 
represented by 

(a) a*ababb* 

(c) a*b*ababa*b* 

8. (a 4 a*)* is equivalent to 
(a) a(a*)* 

(c) aa* 

9. a*(a 4 b)* is equivalent to 
(a) a* 4 b* 

(c) a*b* 


b} having abab as a substring is 

(b) (a 4 b)*abab(a 4 b)* 

(d) (a 4 b)*abab 

(b) a* 

(d) none of these. 

(b) (ab)* 

(d) none of these. 


10. ab* 4 b* represents all strings w over { a . b} 

(a) starting with an a and having no other a's or having no a 's but 
only b 's 

(b) starting with an a followed by V s 

(c) having no a's but only V s 

(d) none of these. 



176 S Theory of Computer Science 


State whether the following Statements 11-17 are true or false. 

11. If I is finite then Z* is finite. 

12. .Every finite subset of E* is a regular language. 

13. Every regular language over E is finite. 

14. # 4 /r is in the regular set given by a*(a + b)b*. 

15. aa* 4- bb* is the same as (a + b)*. 

16. The set of all strings starting with an a and ending in ab is defined by 
the regular expression a(a + b)*b. 

17. The regular expression (a + b)*c* is the same as a*(b + c)*. 

EXERCISES 

5.1 Represent the following sets by regular expressions: 

(a) {0, 1. 2}. 

(b) {l i,+1 | it > 0}. 

(c) {vv € {a, b}* | w has only one a }. 

(d) The set of all strings over {0. 1} which has at most two zeros. 

(e) {a 2 , a-, a*, ...}. 

(f) {a n | n is divisible by 2 or 3 or n - 5}. 

(g) The set of all strings over { a , b} beginning and ending with a. 

5.2 Find all strings of length 5 or less in the regular set represented by the 
following regular expressions: 

(a) (ab + a)*(aa + b) 

(b) (a*b + b*a)*a 

(c) a* + (ab + a)* 

5.3 Describe, in the English language, the sets represented by the following 
regular expressions: 

(a) a(a + b)*ab 

(b) a*h + b*a 

(c) (aa 4- b)*(bb + a)* 

5.4 Prove the following identity: 

(a*ab + ba)*a* = (a + ab + ba)* 

5.5 Construct the transition systems equivalent to the regular expressions 
given in Exercise 5.2. 

5.6 Construct the transition systems equivalent to the regular expressions 
given in Exercise 5.3. 

5.7 Find the set of strings over L = {a, b } recognized by the transition 
systems shown in Fig. 5.37(a-d). 

5.8 Find the regular expression corresponding to the automaton given in 
Fig. 5.38. 
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5,9 Construct a transition system corresponding to the regular expressions 
(i) (ab + c*)*b and (ii) a + bb + bab*a. 

5.10 Find the regular expressions representing the following sets: 

(a) The set of all strings over {0, 1} having at most one pair of Cfs 
or at most one pair of Fs. 

(b) The set of all strings over {a, b} in which the number of 
occurrences of a is divisible by 3. 

(c) The set of all strings over {a, b} in which there are at least two 

occurrences of b between any two occurrences of a. 

(d) The set of all strings over {a, b} with three consecutive b' s. 

(e) The set of all strings over {0, 1} beginning with 00. 

(f) The set of all strings over {0. 1} ending with 00 and beginning 

with 1. 

5.11 Construct a deterministic finite automaton corresponding to the regular 
expression given in Exercise 5.2. 

5.12 Construct a finite automaton accepting all stnngs over {0. 1} ending in 
010 or 0010. 

5.13 Construct a finite automaton M which can recognize DFA in a given 

string over the alphabet {A, B, . . Z). For example, M has to 

recognize DFA in the string ATXDFAMN. 

5.14 Construct a finite automaton for the regular expression (a + b)*abb. 

5.15 Show that there exists no finite automaton accepting all palindromes 
over {a , b). 

5.16 Show that {d ] b n \ n > 0} is not a regular set without using the pumping 
lemma. 

5.17 Using the pumping lemma, show that the following sets are not regular: 

(a) {a n b 2n \n >0}; 

(b) {a n b m | 0 < n < m}. 

5.18 Show that {(FT* | g.c.d. (m, n) = 1} is not regular. 

5.19 Show that a deterministic finite automaton with n states accepting a 
nonempty set accepts a string of length m, m < n. 

5.20 Construct a finite automaton recognizing L{G), where G is the grammar 
5 —» aS | bA | b and A —> aA\ bS\ a . 

5.21 Find a regular grammar accepting the set recognized by the finite 
automaton given in Fig. 5.37(c). 

5.22 Construct a regular grammar which can generate the set of all strings 
starting with a letter (A to Z) followed by a string of letters or digits 
(0 to 9). 
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5.23 Are the following true or false? Support your answer by giving proofs 
or counter-examples. 

(a) If L x u L 2 is regular and L x is regular, then L 2 is regular. 

(b) If L X L 2 is regular and L x is regular, then L 2 is regular. 

(c) If L is regular, then L is regular. 

5.24 Construct a deterministic finite automaton equivalent to the grammar 
S —^ ciS j bS I aA , A —^ bB , B —^ aC, C —^ A. 




Context-Free Languages 


In this chapter we study context-free grammars and languages. We define 
derivation trees and give methods of simplifying context-free grammars. The 
two normal forms—Chomsky normal form and Greibach normal form—are 
dealt with. We conclude this chapter after proving pumping lemma and giving 
some decision algorithms. 

6.1 CONTEXT-FREE LANGUAGES AND DERIVATION 
TREES 

Context-free languages are applied in parser design. They are also useful for 
describing block structures in programming languages. It is easy to visualize 
derivations in context-free languages as we can represent derivations using tree 
structures. 

Let us recall the definition of a context-free, grammar (CFG). G is 
context-free if every production is of the form A —» or, where A € V N and 
a e (V N u Z)*. 


EXAMPLE 6.1 


Construct a context-free grammar G generating all integers (with sign). 

Solution 

Let 

G = (V N , I. P, S ) 


where 


V N = {S, (sign), (digit). (Integer)} 
1= {0, 1. 2, 3, .... 9. +, -} 
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P consists of S —» (sign) (integer), (sign) —» + | - 
(integer) -4 (digit) (integer) | (digit) 

(digit) -4 0 111 21... | 9 

L(G) = the set of all integers. For example, the derivation of -17 can be 
obtained as follows: 

S =$ (sign) (integer) =*> - (integer) 

=> - (digit) (integer) => - 1 (integer) => -1 (digit) 

=> - 17 


6.1.1 Derivation Trees 

The derivations in a CFG can be represented using trees. Such trees 
representing derivations are called derivation trees. We give below a rigorous 
definition of a derivation tree. 

Definition 6.1 A derivation tree (also called a parse tree) for a CFG 
G = (V v . X. P, 5) is a tree satisfying the following conditions: 

(i) Every 7 vertex has a label which is a variable or terminal or A. 

(ii) The root has label S. 

(iii) The label of an internal vertex is a variable. 

(iv) If the vertices w 1? n 2 * . . n k written with labels X\, X 2 , . . X k are 
the sons of vertex n with label A. then A —> X\X 2 . . . X k is a 
production in P. 

(v) A vertex n is a leaf if its label is a € X or A; n is the only son of 
its father if its label is A. 

For example, let G = ({5, A}, {a, b }. P, S). where P consists of S —> 
r/A5|a|S5. A —» SbA | bci. Figure 6.1 is an example of a derivation tree. 

S 
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Note: Vertices 4-6 are the sons of 3 written from the left, and 5 aAS is 
in P. Vertices 7 and 8 are the sons of 5 written from the left, and A -» ba 
is a production in P, The vertex 5 is an internal vertex and its label is A, which 
is a variable. 

Ordering of Leaves from the Left 

We can order all the vertices of a tree in the following way: The successors 
of the root (i.e. sons of the root) are ordered from the left by the definition 
(refer to Section 1.2). So the vertices at level 1 are ordered from the left. If 
vj and v 2 are any two vertices at level 1 and v\ is to the left of v 2 , then we 
say that vj is to the left of any son of v 2 . Also, any son of V\ is to the left 
of v 2 and to the left of any son of v 2 . Thus we get a left-to-right ordering of 
vertices at level 2. Repeating the process up to level k , where k is the height 
of the tree, we have an ordering of all vertices from the left. 

Our main interest is in the ordering of leaves. 

In Fig. 6.1, for example, the sons of the root are 2 and 3 ordered from the 
left. So, the son of 2, namely 10, is to the left of any son of 3. The sons of 
3 ordered from the left are 4-5-6. The vertices at level 2 in the left-to-right 
ordering are 10-4-5-6. The vertex 4 is to the left of 6. The sons of 5 ordered 
from the left are 7-8. So 4 is to the left of 7. Similarly, 8 is to the left of 
9. Thus the order of the leaves from the left is 10-4-7-8-9. 

Note: If we draw the sons of any vertex keeping in mind the left-to-right 
ordering, we get the left-to-right ordering of leaves by 'reading’ the leaves in 
the anticlockwise direction. 

Definition 6.2 The yield of a derivation tree is the concatenation of the 
labels of the leaves without repetition in the left-to-right ordering. 

The yield of the derivation tree of Fig. 6.1, for example, is aabaci. 

Note: Consider the derivation tree in Fig. 6.1. As the sons of 1 are 2-3 in 
the left-to-right ordering, by condition (iv) of Definition 6.1, we have the 
production S —» SS . By applying the condition (iv) to other vertices, we get 
the productions S — > a, S — > aAS , A —> ba and 5 —» a. Using these 
productions, we get the following derivation: 

S SS ==> as =$ aaAS aabaS => aabaa 

Thus the yield of the derivation tree is a sentential form in G. 

Definition 6.3 A subtree of a derivation tree T is a tree (i) whose root is 
some vertex v of T\ (ii) whose vertices are the descendants of v together with 
their labels, and (iii) whose edges are those connecting the descendants of v. 

Figures 6.2 and 6.3. for example, give two subtrees of the derivation tree 
shown in Fig. 6.1. 
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Fig. 6.2 A subtree of Fig. 6.1. Fig. 6.3 Another subtree of Fig. 6.1. 

Note: A subtree looks like a derivation tree except that the label of the root 
may not be S . It is called an A-tree if the label of its root is A. 

Remark When there is no need for numbering the vertices, we represent the 
vertices by points. The following theorem asserts that sentential forms in CFG 
G are precisely the yields of derivation trees for G . 

Theorem 6.1 Let G = (V Y , X, P. S ) be a CFG. Then S => a if and only if 
there is a derivation tree for G with yield a. 

Proof We prove that A ==> a if and only if there is an A-tree with yield a. 
Once this is proved, the theorem follows by assuming that A = S. 

Let a be the yield of an A-tree T. We prove that A A aby induction on 
the number of internal vertices in T. 

When the tree has only one internal vertex, the remaining vertices are 
leaves and are the sons of the root. This is illustrated in Fig. 6.4. 


A 



By condition (iv) of Definition 6.1. A —> ApL . . . A m = a is a production 
in G, i.e. A => a. Thus there is basis for induction. Now assume the result 
for all trees with at most k - 1 internal vertices (k > 1). 

Let T be an A-tree with k internal vertices (k > 2). Let v h v 2 , .... v m 
be die sons of the root in the left-to-right ordering. Let their labels be 
X h . . .. X m . By condition (iv) of Definition 6.1, A -» X^ 2 . . . X m is in 
P, and so 


A => X x X 2 - . . X m 


(6.1) 
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As k > 2, at least one of the sons is an internal vertex. By the left-to-right 
ordering of leaves, a can be written as Of] a 2 ... oy, where Of z is obtained by 
the concatenation of the labels of the leaves which are descendants of vertex 
V/. If v/ is an internal vertex, consider the subtree of T with v f as its root. The 
number of internal vertices of the subtree is less than k (as there are k internal 
vertices in T and at least one of them, viz. its root, is not in the subtree). So 
by induction hypothesis applied to the subtree, A) ==> Of;. If v,- is not an internal 
vertex, i.e. a leaf, then X = a v 

Using (6.1), we get 

A => X x X 2 ... X m =S> <y.\X 2 X 3 ... X m .=?> a x a 2 ... a,,, = or, 

i.e. A A a By the principle of induction, A => a whenever a is the yield 
of an A-tree. 

To prove the ‘only if’ part, let us assume that A i a, We have to 
construct an A-tree whose yield is Of. We do this by induction on the number 
of steps in A => Of. 

When A => Of, A —» Of is a production in P. If a - XfX 2 . .. X m , the 
A-tree with yield a is constructed and given as in Fig. 6.5. So there is basis 
for induction. Assume the result for derivations in at most k steps. Let 
A =4> a: we can split this as A =^> X\ ... X m *=> Of. Now, A => X x . . . X m 
implies A -» X x X 2 . . . X m is a production in F. In the derivation X x X 2 . . . X m 

Of, either (i) X } is not changed throughout the derivation, or (ii) X; is 
changed in some subsequent step. Let O'- be the substring of a derived from X r 
Then X: => 0 f z - in (ii) and X; = a t in (i). As G is context-free, in every step of 
the derivation X{X 2 .. . X m —> a, we replace a single variable by a string. As 
Of}, a 2 , . . 0 f„ ; , account for all the symbols in a, we have a = Gf^ . . . a m . 


A 



Fig. 6.5 Derivation tree for one-step derivation. 

We construct the derivation tree with yield a as follows: As A —» X { ... X m 
is in P . we construct a tree with m leaves whose labels are . . ., X m in the 
left-to-right ordering. This tree is given in Fig. 6.6. In (i) above, we leave the 
vertex as it is. In (ii). Xj => Of/ is less than k steps (as X l . . . X m ^ a). By 
induction hypothesis there exists an Aytree T i with yield a v We attach the tree 
T: at the vertex v,- (i.e. v } is the root of T } ). The resulting tree is given in 
Fig. 6.7. In this figure, let i and j be the first and the last indexes such that 
X: and Xj satisfy (ii). So, a x . . . 0 f M are the labels of leaves at level 1 in T. 
Of,- is the yield of the Aytree T r etc. 
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Fig. 6.7 Derivation tree with yield a\Oc 2 ... ct n > 

Thus we get a derivation tree with yield ol By the principle of induction 
we can get the result for any derivation. This completes the proof of 'only if' 
part. I 

Note: The derivation tree does not specify the order in which we apply the 
productions for getting ol So, the same derivation tree can induce several 
derivations. 

The following remark is used quite often in proofs and constructions 
involving CFGs. 

Remark If A derives a terminal string w and if the first step in the derivation 
is .4 =» A\Az • • ■ A n , then we can write w as w\w 2 . . . w n so that 
Aj ==> Wj. (Actually, in the derivation tree for n\ the /th son of the root has 
the label A h and is the yield of the subtree whose root is the ith son.) 


EXAMPLE 6.2 


Consider G whose productions are S —> ciAS | a, A —> SbA | SS | ba. Show that 
S => aabbaa and construct a derivation tree whose yield is aabbaa. 



186 B Theory of Computer Science 


Solution 

S => aAS asbAS => aabAS => orbbaS => crb^a 2 (6.2) 
Hence. S => ccb 2 cr. The derivation tree is given in Fig. 6.8. 



Fig. 6.8 The derivation tree with yield aabbaa for Example 6.2. 

Note: Consider G as given in Example 6.2. We have seen that S ==> a 2 b~a 2 , 
and (6.2) gives a derivation of a~b 2 a^. 

Another derivation of a~b~a~ is 

S => aAS => aAa aSbAci => aSbbaa =» aabbaa (6.3) 

Yet another derivation of a 2 b 2 a 2 is 

5 => aAS => aSbAS =» aSbAa => aabAa aabbaa (6.4) 

In derivation (6.2). whenever we replace a variable X using a production, 
there are no variables to the left of X. In derivation (6.3), there are no variables 
to the right of X. But in (6.4), no such conditions are satisfied. These lead to 
the following definitions. 

Definition 6.4 A derivation A => w is called a leftmost derivation if we 
apply a production only to the leftmost variable at every step. 

Definition 6.5 A derivation A ==> w is a rightmost derivation if we apply 
production to the rightmost variable at every step. 

Relation (6.2). for example, is a leftmost derivation. Relation (6.3) is a 
rightmost derivation. But (6,4) is neither leftmost nor rightmost. In the second 
step of (6.4). the rightmost variable 5 is not replaced. So (6.4) is not a 
rightmost derivation. In the fourth step, the leftmost variable S is not replaced. 
So (6.4) is not a leftmost derivation. 

Theorem 6.2 If A => \v in G, then there is a leftmost derivation of w. 
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Proof We prove the result for every A in V N by induction on the number 
of steps in A vc. A => w is a leftmost derivation as the L.H.S. has only 
one variable. So there is basis for induction. Let us assume the result for 

derivations in atmost k steps. Let A w. The derivation can be split as 
A => . .. X m 4- w. 

The string w can be split as w { w 2 . . . w m such that X { =» (see the 
Remark appended before Example 6.2). As X, => involves atmost k steps 
by induction hypothesis, we can find a leftmost derivation of Using these 
leftmost derivations, we get a leftmost derivation of w given by 

A X\X 2 . . . X m => WiX 2 . . . X m =$ w\w 2 X 2 . . . X m ... => WjH ’2 . . . w m 
Hence by induction the result is true for all derivations A w. I 

Corollary Even 7 derivation tree of w induces a leftmost derivation of w. 

Once we get some derivation of vt\ it is easy to get a leftmost derivation of 
w in the following way: From the derivation tree for w, at every level consider 
the productions for the variables at that level, taken in the left-to-right ordering. 
The leftmost derivation is obtained by applying the productions in this order. 


EXAMPLE 6.3 


Let G be the grammar S —> OB 11A. A —» 0 j 05 j IAA, B —> 111510 BB. For 
the string 00110101, find (a) the leftmost derivation, (b) the rightmost 
derivation, and (c) the derivation tree. 

Solution 

(a) 5 => 05 => 0055 =» 0015 => 00115 

^ 0 2 1 2 05 => 0 2 1 2 015 => 0 2 1 2 0105 => 0 2 1 2 0101 

(b) 5 => 05 0055 => 00515 => 006105 

=> 0 2 51015 => 0 2 610105 => 0 2 510101 => 0 2 110101. 

(c) The derivation tree is given in Fig. 6.9. 



Fig. 6.9 The derivation tree with yieid 00110101 for Example 6.3. 
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6.2 AMBIGUITY IN CONTEXT-FREE GRAMMARS 

Sometimes we come across ambiguous sentences in the language we are using. 
Consider the following sentence in English: “In books selected information is 
given.” The word ‘selected' may refer to books or information. So the sentence 
may be parsed in two different ways. The same situation may arise in context- 
free languages. The same terminal string may be the yield of two derivation 
trees. So there may be two different leftmost derivations of w by Theorem 6.2. 
This leads to the definition of ambiguous sentences in a context-free language. 

Definition 6.6 A terminal string w e L(G ) is ambiguous if there exist two 
or more derivation trees for w (or there exist two or more leftmost derivations 
of w). 

Consider, for example, G = ({*S}, {a, b, +, *}, P, S), where P consists 
of S S + S\S * S\a\b. We have two derivation trees for a + a * b given 
in Fig. 6.10. 




Fig. 6.10 Two derivation trees for a + a * b. 

The leftmost derivations of a + a * b induced by the two derivation trees 
are 

S=$S + S=$a + S=$a + S*S=$a + a*S^a + a*b 
S=$S*S^>S + S*S=$a + S*S=>a + a*S=^a + a*b 
Therefore, a + a * b is ambiguous. 

Definition 6.7 A context-free grammar G is ambiguous if there exists some 
w € L(G), which is ambiguous. 


EXAMPLE 6.4 


If G is the grammar S —» SbS\a, show that G is ambiguous. 

Solution 

To prove that G is ambiguous, w r e have to find a w e L(G), which is 
ambiguous. Consider w = cibababa 6 L(G). Then we get two derivation trees 
for w (see Fig. 6.11). Thus. G is ambiguous. 



Chapter 6: Context-Free Languages B 189 



S 



Fig. 6.11 Two derivation trees of abababa for Example 6.4. 

6.3 SIMPLIFICATION OF CONTEXT-FREE GRAMMARS 

In a CFG G, it may not be necessary to use all the symbols in V N u 2, or 
all the productions in P for deriving sentences. So when we study a context- 
free language L(G). we try 7 to eliminate those symbols and productions in G 
w r hich are not useful for the derivation of sentences. 

Consider, for example. 

G = ({5. A. B, C, E], { a , B, c}, P, 5) 

where 

P = {5 —^ AB, A —^ B —^ B, B —^ G, B ^ e|A} 

It is easy to see that L(G) = {ab}. Let G' = ({S, A. B}, {a, B}, P\ 5), where 
P' consists of S —> AB, A a, B b. L(G) = L(G r ). We have eliminated 
the symbols C, E and c and the productions B —» C. £ —» c | A. We note the 
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following points regarding the symbols and productions which are eliminated: 

(i) C does not derive any terminal string. 

(ii) E and c do not appear in any sentential form. 

(iii) E —> A is a null production. 

(iv) B —> C simply replaces B by C. 

In this section, we give the construction to eliminate (i) variables not 
deriving terminal strings, (ii) symbols not appearing in any sentential form, 
(iii) null productions, and (iv) productions of the form A -4 B. 


6.3.1 Construction of Reduced Grammars 

Theorem 6.3 If G is a CFG such that L(G) * 0, we can find an equivalent 
grammar G such that each variable in G' derives some terminal string. 

Proof Let G = (V v , X, P, 5). We define G' = (F' v , X, G\ S ) as follows: 

(a) Construction of V' N : 

We define c V N by recursion: 

W { = {A e V( v | there exists a production A —> w where w e X*}. (If 
W\ ~ 0. some variable will remain after the application of any production, and 
so L(G) = 0.) 

W i+ i = W f u {A e V N \there exists some production A —> a 

with a e (X u Wf)*} 

By the definition of W h Wj c W i+] for all 1 As V N has only a finite number 
of variables, W k = W k+{ for some k < \ V N [ Therefore, W k = W k+ j for j > 1. 
We define V' N = W k . 

(b) Construction of P': 

P' = {A -> a|A, a e (V^ u X)*} 

We can define G' = (V'v, X, Ph 5). 5 is in F v . (We are going to prove that 
ever}' variable in V N derives some terminal string. So if S <£ V N , L(G) = 0. 
But L(G) * 0.) 

Before proving that G / is the required grammar, we apply the construction 
to an example. 


EXAMPLE 6.5 


Let G = (V>v, X. P, S ) be given by the productions 5 —> AB , A —> a, B b, 
B —> C. E —» c. Find G' such that every 7 variable in G / derives some terminal 
string. 

Solution 

(a) Construction of V' N : 

Wi = {A, P. £} since A -¥ afB b, E c are productions with a 
terminal string on the R.H.S. 
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W 2 = W\ u {Ai e V N \ A] —> a for some a e (X u {A, £, £})*} 
= W] u {5} = {A, 5, E, 5} 

= W 2 u {Aj e V n \A\ -» a for some a e (X u {5* A, 5, £})*} 
= W 2 u $ = W 2 
Therefore, 

Kv = A, B, F} 

(b) Construction of P 

P' - [A { -> a|A b a e (V'V u I)*} 

~ (5 —> AB, A —> g, B —> b, E —> c} 

Therefore, 

G' = ({5, A, B, £}, {a, b. c}, P\ 5) 

Now we prove: 

* 

(i) If each A 6 V% then A =S> vv for some vv e X*; conversely, if A ^ vv, 
then A e V N . 

(ii) KG') = KO- 

To prove (i) we note that W k = W { u W 2 . . . u W*. We prove by 

induction on i that for /.= 1, 2. K A € W/ implies A ==> w for some 

G' 

vv € P. If A 6 W u then A => w. So the production A —> w is in P'. 
Therefore. A => u*. Thus there is basis for induction. Let us assume the result 

G’ 

for L Let A e W i+{ . Then either A e W h in which case. A => vv for some 

G' 

iv e X* by induction hypothesis. Or, there exists a production A —» a with 
Of e (Xu vv 2 )*. By definition of P\ A —> a is in P', We can write 
Of = X|Z 2 . . . X m - where Xj e X u W}. If Xj e W/ by induction hypothesis, 

Xj => vv,- for some w,- e X*. So. A ==> wpvj . . . vv,„ e X* (when X; is a terminal, 
G* G' 

wj = X;). By induction the result is true for i = 1. 2, . . k. 

The converse part can be proved in a similar w r ay by induction on the 

number of steps in the derivation A => vv. We see immediately that KG') c 

G 

KG) as V' N c V N and P' c P. To prove L(G) c L(G0, we need an auxiliary 
result 

A =b> w if A => vv for some w 6 X* (6.5) 

G' g 

We prove (6.5) bv induction on the number of steps in the derivation A => w. 

G 

If A => vv. then A —> vv is in P and A € W\ c As A e V# and vv ’ £ X*, 

G 

A —» vv is in P'. So A => w. and there is basis for induction. Assume (6.5) 

G' 

k+\ 

for derivations in at most k steps. Let A =^> vv. By Remark appealing after 

G 
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Theorem 6.1, we can split this as A => X t X-> . . . X,„ => w l w 2 ■ ■ ■ w m such 

G G 

that Xj => wj If Xj e L. then Wj = Xj. 

G * 

If Xj e V N then by (i), Xj e V' N . As Xj ^ wj in at most k steps, 
Xj => w ; . Also, Xt, Xi, X,„ € (I u V'v)* implies that A -> X t X 2 . .. X,„ is 

' G‘ 

in P'. Thus, A => X t X 2 . . . X,„ =e> w { w 2 ■.. w,„. Hence by induction, (6.5) 

G' G' * 

is true for all derivations. In particular, S ^ w implies S => w. This proves 

G G' 

that L(G) c L(G'), and (ii) is completely proved. I 

Theorem 6.4 For every CFG G = (V^, 2, P, 5), we can construct an 
equivalent grammar G' = (V N , T, P', 5) such that every symbol in V' A , u T 
appears in some sentential form (i.e. for every X in V' N u I' there exists a 

such that S => a and X is a svmbol in the string a). 

G' 

Proof We construct G' = (V" v , I', P\ S ) as follows: 

(a) Construction of Wj for i > 1: 

(i) W, = {5}. 

(ii) W M = W t u {X G V N u 1 1 there exists a production A —> a with 

A e Wj and a containing the symbol X). 

We may note that W l c V N u I and Wj c W l+] . As we have only a finite 
number of elements in V, v u 2, W k = W k+ \ for some k. This means that 
W k = W k+j for all j >0. 

(b) Construction of V' N , I' and P': 

We define 

V' N = V N n W h 2' = 2 u W k 
P'= {A -> a\ A € WJ. 

Before proving that G' is the required grammar, we apply the construction to 
an example. 


EXAMPLE 6.6 


Consider G - ({5, A, B , E}, {a , ft, c}, P, 5), where P consists of S AP, 
A —^ B — ^ ft, P — ¥ c. 

Solution 

= {5} 

W 2 = {S} u {X € u 11 there exists a production A — > a with 

A e Wi and a containing X} 


= {5} u {A, B} 
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W 3 = {S, A. B} u { a, b} 

W 4 = W 3 

7* = {5, A, B} r = {a, 

F = {5 -> AS, A -> a. 5 -» 

Thus the required grammar is G' - (V#, X', P', 5). 

To complete the proof, we have to show that (i) every symbol in V' N u 
S' appears in some sentential form of G', and (ii) conversely, L(G') = L(G). 

To prove (i), consider X e V' N u S' = W*, By construction W* = W\ u 
W 2 ... u W*. We prove that X e W if i < L appears in some sentential form 

by induction on i. When i = 1. X = S and S => S. Thus, there is basis for 

induction. Assume the result for all variables in W- r Let X e W/ +1 . Then either 
X e W 7 -, in which case, X appears in some sentential form by induction 
hypothesis. Otherwise, there exists a production A —> (X, where A e W) and 
a contains the symbol X r The A appears in some sentential form, say /3Ay 
Therefore. 


5 => /3Ay => /3ay 

G' G' 

This means that /3ayis some sentential form and X is a symbol in pay Thus 
by induction the result is true for X e W h i < k. 

Conversely, if X appears in some sentential form, say ftXy then X => fiXy 
This implies X e W/. If / < k. then W 7 / c If / > k, then W\ = W^ G Hence 
X appears in V' N u X'. This proves (i). 

To prove (ii), we note L(G') c L(G ) as c V N , X' c X and P'c P. 
Let H’ be in L(G) and 5 = cq => csr 3 = . . . => => w. We prove 

G " G G 

that every symbol in a M is in W i+ j and a z a i+1 by induction on i. 

(j f 

OT] = 5 => a 2 implies 5 —» a 2 is a production in P. By construction, every 

G 

symbol in a 2 is in W 2 and 5 —> a 2 is in P\ i.e. S => a 2 . Thus, there is basis 

G 

for induction. Let us assume the result for 1 Consider a /+1 => a /+2 . This one- 
step derivation can be written in the form ° 

PmA-Ym => A+i«^+i 

where A —» a is the production we are applying. By induction hypothesis, 
A e Wj+[. Bv construction of W/ +2 , every symbol in a is in W i+2 - As all the 
symbols in f5 i+l and y +1 are also in W {+ j by induction hypothesis, every symbol 
in Pi+\Ocy + \ - GCj +2 is in W /+2 . By the construction of P\ A —» a is in P\ 

This means that a z+I => a z+2 . Thus the induction procedure is complete. 
G' 

So 5 = cq => gs => a 3 => . .. => w\ Therefore, w € L(G'). This 

g' “ g' ~ G' G' 

proves (ii). I 
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Definition 6.8 Let G = (V N , 2, P, S ) be a CFG. G is said to be reduced or 
non-redundant if every symbol in V N u I appears in the course of the 
derivation of some terminal string, i.e. for every X in V N u 2, there exists 
a derivation S oX/3 w e L(G). (We can say X is useful in the 
derivation of terminal strings.) 

Theorem 6.5 For every CFG G there exists a reduced grammar G ' which is 
equivalent to G. 

Proof We construct the reduced grammar in two steps. 

Step 1 We construct a grammar Gj equivalent to the given grammar G so 
that every variable in G x derives some terminal string (Theorem 6.3). 

Step 2 We construct a grammar G' ~ (V 7 ( v , S', P\ S ) equivalent to G\ so that 
every symbol in G' appears in some sentential form of G' which is equivalent 
to G\ and hence to G. G' is the required reduced grammar. 

By step 2 every 7 symbol X in G' appears in some sentential form, say 
aXl1 By step 1 every symbol in aX/3 derives some terminal string. Therefore, 
5 => aX[3 =?> w for some w in 2*, i.e. G' is reduced. 

Note: To get a reduced grammar, we must first apply Theorem 6.3 and then 
Theorem 6.4. For, if we apply Theorem 6.4 first and then Theorem 6.3, we may 
not get a reduced grammar (refer to Exercise 6.8 at the end of the chapter). 


EXAMPLE 6.7 


Find a reduced grammar equivalent to the grammar G whose productions are 
S —^ AB jCA. B —^ BC\AB. A —^ C —^ ciB j h 

Solution 


W x = 

{A, C} 

as A — > a 

and 

C -> b 

are 

productions with a terminal 


string on R.H.S. 





W 2 = 

{A. C} 

u {A, | A 

i “> 

a with 

a 

e (S u {A, C})*} 

= 

{A. C} 

u {5} as 

we 

have S 


CA 

w 3 = 

{A, C, 

S} u {A[ 

1 

—> a with 

a e (I u {S, A, C})*} 

= 

{A. C, 

S} u 0 





w 2 . 








V’x-- 

= W ; = {5 

♦ A, 

C} 




P' - 

- {A| — > oc [ A 

!, a e 

(V' f 



— {S —> CA. A —^ u. C —^ b } 

Thus. 


G, = ({S, A. C}, {a. b}, {5 -» CA. A -» a, C -> b), 5) 
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Step 2 We have to apply Theorem 6.4 to Gj. Thus, 

Wi = {S} 

As we have production S —> CA and 5 e W 2 = {5} u {A, C} 

As A —> a and C —» b are productions with A, C € W 2 , W 3 = {5, A, C, a, fc} 

As W 3 = V' N u I, P" = {S -» a\A x e W 3 } = P' 

Therefore, 

G' = ({5, A, C}, {a, 6}, {5 -> CA, A -> a, C fe}, 5) 
is the reduced grammar. 


EXAMPLE 6.8 


Construct a reduced grammar equivalent to the grammar 

S —» aAa, A —> Sb | bCC \ DaA , C —> abb | £>D, 

£ -» flC, D -> a£>A 

Solution 

Step 1 VV'] = {C} as C — » abb is the only production with a terminal string 
on the R.H.S. 


W 2 = {C} u {£. A} 

as £ —> aC and A —> bCC are productions with R.H.S. in (Z u {C})* 
W 3 = {C, E. A} u {S} 
as S —» aAa and oAa is in (Z u Wh)* 

W 4 = W 3 u 0 

Hence. 

R'v = W 3 = {S, A, C. £} 

£' = {Aj —> a|a e (R v u Z)*} 

= {5 —» aAa, A —> Sb \ bCC, C —> abb, E —> aC} 
G x = (V' s , {a, b], P', S) 

Step 2 We have to apply Theorem 6.4 to G\ . We start with 


As we have S —> aAa, 
As A -> Sb I hCC. 


W, = {5} 

W z = {5} u {A. a) 


W, = {5, A, a} u { S . 6, C} = {5, A, C. a, h} 


As we have C —» ah£ 


W 4 = W 3 u {a. b } = W 3 
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Hence. 

P = {A\ — > cc | A| e W 3 } 

= {S —> aAa, A —> SZ? } bCC , (7 —> 

Therefore. 

G' = ({5. A, C}, {a, 6 }, P", 5) 


is the reduced grammar. 


6.3.2 Elimination of Null Productions 

A context-free grammar may have productions of the form A —> A. The 
production A —» A is just used to erase A. So a production of the form A —> 
A, where A is a variable, is called a null production. In this section we give a 
construction to eliminate null productions. 

As an example, consider G whose productions are S —> aS | aA | A, 
A —» A. We have two null productions S —» A and A —» A. We can delete 
A —> A provided we erase A whenever it occurs in the course of a derivation 
of a terminal string. So we can replace S —» aA by S a. If G\ denotes 
the grammar whose productions are S aS \ a | A, then L(Gi) = L(G) = 
{a n | n > 0}. Thus it is possible to eliminate the null production A A. If 
we eliminate S —> A, we cannot generate A in L(G). But we can generate 
L(G) - {A} even if we eliminate S —» A. 

Before giving the construction we give a definition. 

Definition 6.9 A variable A in a context-free grammar is nullable if A A. 

Theorem 6.6 If G = (Y v , X, P, S) is a context-free grammar, then we can 
find a context-free grammar Gi having no null prodctions such that L(G { ) = 
L(G) - {A}. 

Proof We construct Gj = (V#, X. P', S ) as follows: 

Step 1 Construction of the set of nullable variables: 

We find the nullable variables recursively: 

(i) w x = {A e V'v | A -> A is in P} 

(ii) W /+1 = Wf u {Ae V N \there exists a production A —» a with a e Wf*}. 

By definition of W r c W M for all 1 As V N is finite, W k+ \ = W* for some 
£ < | V N |. So, Wk+j = W k for all j. Let W = W*. W is the set of all nullable 
variables. 

Step 2 (i) Construction of P': 

Any production whose R.H.S. does not have any nullable variable is included 
in P\ 

(ii) If A —» XiX 2 . . . X k is in P, the productions of the form A -4 a { a 2 
... a* are included in P\ where a- } = X f if X,- £ W. = X, or A if X/ e 
W and «ia 2 ... a* * A. Actually, (ii) gives several productions in P' The 
productions are obtained either by not erasing any nullable variable on the 
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R.H.S. of A —^ X x X 2 . . . X k or by erasing some or all nullable variables 
provided some symbol appears on the R.H.S. after erasing. 

Let G[ = (L v , X, P', 5). Gj has no null productions. 

Before proving that Gj is the required grammar, we apply the construction 
to an example. 


EXAMPLE 6.9 


Consider the grammar G whose productions are S —> aS \ AB, A —> A. 
B —» A, D -» b. Construct a grammar G ] without null productions generating 
L(G) - {A}. 


Solution 

Step 1 Construction of the set W of all nullable variables: 

W, = {A, € V N \ A i —¥ A is a production in G} 


Thus. 


= (A, B] 

W 2 = {A, £} u {S} as S *“» AB is a production with AB e Wf. 

= {S, A, B} 

W ? = W 2 u 0= W 2 

W = ^2 = (5. A, P} 


Step 2 Construction of P': 

(i) Z> —> b is included in P\ 

(ii) S aS gives rise to S aS and 5 —> a. 

(iii) S —> AB gives rise to S —> AB . S A and 5 P. 

(Ate; We cannot erase both the nullable variables A and B in S —> AP as we 
will get 5 —» A in that case.) 

Hence the required grammar without null productions is 

Gj = ({5, A, B , Z>} 5 P, 5) 

where P' consists of 

D —> b. S —> oS, 5 —> AP. i5 —> S —> A. P —> P 

Step 3 L(G0 = L(G) - {A}. To prove that L(G) = L(G) - {A}, we prove 
an auxiliary' result given by the following relation: 

For all A e V N and w e X*, 

A w if and only if A => w and w —> A (6.6) 

G, G 

* * 

We prove the ‘if part first. Let A => w and w * A. We prove that A => vt ; 

G G, 

by induction on the number of steps in the derivation A => w. If A => w and 

G G 
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w ^ A, A —» w is a production in P\ and so A w. Thus there is basis for 

o x . 

induction. Assume the result for derivations in at most i steps. Let A => w and 

G 

w ^ a. We can split the derivation as A ==> X x X 2 . . . X k => W\W~> . . . w k , 

g G - 

where w = w x w 2 .. . w* and Ay => wy As w ^ A, not all vty s are A. If wj ^ A, 

G 

then by induction hypothesis, Xj => uy If wj ~ A, then Xj e W. So using the 

G, 

production A —> ■ ■ ■ A k in P, we construct A —> c^ori ■ • • a k in P\ 

where a ; - = Xj if wj A A and a) = A if wj = A (i.e. Xj e W). Therefore, 

He * 

A => Mob=> wi(X 2 ... a k => . . ■ => wpv 2 . . . w* = >v 

G i G i 

By the principle of induction, the ‘if’ part of (6.6) is proved. 

We prove the ‘only if’ part by induction on the number of steps in the 

derivation of A => w. If A => w, then A w is in P { . By construction of P\ 
G, G, 

A —> w is obtained from some production A X { X 2 . .. X n in P by erasing 
some (or none of the) nullable variables. Hence A => X x X 2 ... X n => w. So 

G G 

there is basis for induction. Assume the result for derivation in at most j steps. 

y+l J 

Let A => w. This can be split as A => X x X^ .. . X k w x wi ... w h where 
G G, " G, 

X} => ny The first production A —» - X k in P' is obtained from some 

G 

production A ^ a in P by erasing some (or none of the) nullable variables 

in cr. So A => or => Z,Z, . . . Z*. If Z, s I then Z f Z f = w.. If Z, € 
g g " G 

then by induction hypothesis, Z, w,-. So, we get A => Z t Z 2 . . . X k => 

G G G 

WjVt ’2 ... wy. Hence by the principle of induction whenever A => w, we have 

G x 

A => w and w ^ A. Thus (6.6) is completely proved. 

G 

By applying (6.6) to 5. we have w e L{G{) if and only if w e L(G) and 
w * A. This implies UG X ) = L(G ) - {A}. I 

Corollary 1 There exists an algorithm to decide whether A e L(G) for a 
given context-free grammar G . 

Proof A e L(G) if and only if S e W. i.e. S is nullable. The construction 
given in Theorem 6.6 is recursive and terminates in a finite number of steps 
(actually in at most | V N \ steps). So the required algorithm is as follows: 
(i) construct W ; (ii) test whether S € W, 
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Corollary 2 If G - (V N , Z. P. S) is a context-free grammar we can find an 
equivalent context-free grammar G x = (V'y, Z, P, S x ) without null productions 
except S[ —> A when A is in L(G). If S x —» A is in P 1? S x does not appear 
on the R.H.S. of any production in P x . 

Proof By Corollary 1, we can decide whether A is in L(G). 

Case 1 If A is not in L(G), G { obtained by using Theorem 6.6 is the required 
equivalent grammar. 

Case 2 If A is in L(G), construct G - (V v , Z, P\ S) using Theorem 6.6. 
L(G') = L(G) - {A}. Define G x = (V N u {SJ, Z, P b S{) 9 where P, = P'u 
» 5, Sj —» A}. does not appear on the R.H.S. of any production in 
P h and so G x is the required grammar with L(G X ) = L(G). I 

6.3.3 Elimination of Unit Productions 

A context-free grammar may have productions of the form A P, A, P 
e V N . 

Consider, for example. G as the grammar S —> A, A -» P, B —> C, 
C —» a. It is easy to see that L(G) = {a}. The productions 5 —> A, A —» P, 
B —> C are useful just to replace 5 by C. To get a terminal string, we need 
C —» If G x is S —» a, then L(Gi) = L(G). 

The next construction eliminates productions of the form A —> B. 

Definition 6.10 A unit production (or a chain rule) in a context-free 
grammar G is a production of the form A —» P, where A and B are variables 
in G. 

Theorem 6.7 If G is a context-free grammar,we can find a context-free 
grammar G x which has no null productions or unit productions such that 
L(G X ) = L(G). 

Proof We can apply Corollary 2 of Theorem 6.6 to grammar G to get a 
grammar G' = (V N , Z, P. S) without null productions such that L(G') = L(G). 
Let A be any variable in Ly. 

Step 1 Construction of the set of variables derivable from A: 

Define W,(A) recursively as follows: 

W 0 (A) = {A} 

W m (A) = Wj(A ) u (B e V N \ C -» B is in P with C € Wj(A)} 

By definition of W,(A), Wj(A) c W M (A). As V N is finite, W k+] (A ) = W k {A) 
for some k < |V v |. So. W k+j (A) = W,(A) for all j > 0. Let W(A) = W k (A). Then 
W(A. is the set of all variables derivable from A. 

Step 2 Construction of A-productions in G x : 

The A-productions in G x are either (i) the nonunit production in G 7 or 
(ii) A a whenever B —> a is in G with B e W(A) and a £ V^ v . 
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(Actually, (ii) covers (i) as A e W(A)). Now, we define G\ = (V N , Z, P,. S), 
where Pi is constructed using step 2 for every A e V N . 

Before proving that G\ is the required grammar, we apply the construction 
to an example. 



Let G be S —» AB, A —» a, B —> C\b, C —> D. D —» E and E a. Eliminate 
unit productions and get an equivalent grammar. 

Solution 

Step 1 V7 0 (S) = {5}, W X (S) = W 0 (S) u 0 
Hence W(5) = {5}. Similarly, 

W(A) = {A}, W(E) = {£} 

W 0 (B) = {£}, W\(B) = {B} u {C} = {B, C} 

W 2 (B)= {B. C} u {£>}. W 3 (B) = {B, C, D} u {£}, W 4 (B) = W 3 (B) 
Therefore, 

W{B) = {B, C, D, £} 

Similarly, 

W 0 (O = {C}, w t (0 = {C, D}. W 2 {C} = {C, D, £} = W 3 (C) 

Therefore, 

W(Q = {C, D, £}, W 0 (£>) = {D} 

Hence, 

W,(D) = {£.£} = W,(£>) 

Thus. 

W(D) = {£>,£} 

Step 2 The productions in G\ are 

S —^ AB, A —^ E —^ # 

B —^ b j ci* C —^ g* D —^ (i 

By construction. G x has no unit productions. 

To complete the proof we have to show that L(G') = L(G\). 

Step 3 L(G ') = L(G). If A -4 a is in P { - P, then it is induced by B a 
in P with B e W(A), a £ V N . B e W(A) implies A => B. Hence, A B 

G’ G' 

a . So, if A =^> a, then A ■=> a. This proves L(Gi) c L(G')- 

G’ q 1 

To prove the reverse inclusion, we start with a leftmost derivation 
5 => 0 C\ => (x -1 ... = w 

G G G 


in G\ 
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Let i be the smallest index such that Of: => a j+] is obtained by a unit 

G 

production and j be the smallest index greater than i such that a-, (Xj + i is 

* G ' 

obtained bv a nonunit production. So. S =^> a r and Of; => a j+x can be 

G, ‘ G’ 

written as 

Of/ = WiAifa => w/A /+1 /3/ =>...=> WiAjfii ^ = (Xj+i 

Aj e W(A/) and Ay —> / is a nonunit production. Therefore, Aj —> y is a 

production in Pj. Hence, a ; - => a- ]+x . Thus, we have S => GCj + \. 

g ! t?! 

Repeating the argument whenever some unit production occurs in the 
remaining part of the derivation, we can prove that S => a }1 = w. This proves 

G, 

L(G') c L(G). I 

Corollary If G is a context-free grammar, we can construct an equivalent 
grammar G' which is reduced and has no null productions or unit productions. 

Proof We construct G x in the following way: 

Step 1 Eliminate null productions to get G x (Theorem 6.6 or Corollary 2 of 
this theorem). 

Step 2 Eliminate unit productions in G x to get G 2 (Theorem 6.7). 

Step 3 Construct a reduced grammar G f equivalent to G x (Theorem 6.5). G' 
is the required grammar equivalent to G. 

Note: We have to apply the constructions only in the order given in the 
corollary of Theorem 6.7 to simplify grammars. If we change the order we 
may not get the grammar in the most simplified form (refer to Exercise 6.11). 


6.4 NORMAL FORMS FOR CONTEXT-FREE GRAMMARS 

In a context-free grammar, the R.H.S. of a production can be any string of 
variables and terminals. When the productions in G satisfy certain restrictions, 
then G is said to be in a 'normal form". Among several 'normal forms' we study 
two of them in this section—the Chomsky normal form. (CNF) and the 
Greibach normal form. 


6.4.1 Chomsky Normal Form 

In the Chomsky normal form (CNF), we have restrictions on the length of 
R.H.S. and the nature of symbols in the RH.S. of productions. 

Definition 6.11 A context-free grammar G is in Chomsky normal form if 
even' production is of the form A —» a, or A —» PC, and S A is in G if 
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A e L(G). When A is in L(G), we assume that S does not appear on the 
R.H.S. of any production. 

For example, consider G whose productions are S —> AB \ A, A —» ft, 
B b. Then G is in Chomsky normal form. 

Remark For a grammar in CNF, the derivation tree has the following 
property: Every node has atmost two descendants—either two internal vertices 
or a single leaf. 

When a grammar is in CNF, some of the proofs and constructions are 
simpler. 

Reduction to Chomsky Normal Form 

Now we develop a method of constructing a grammar in CNF equivalent to a 
given context-free grammar. Let us first consider an example. Let G be S 
ABC | aC, A —¥ a. B —» b, C —> c. Except S —> aC \ ABC , all the other 
productions are in the form required for CNF. The terminal a in S -4 aC can 
be replaced by a new variable D. By adding a new production D —> ft, the effect 
of applying S —» aC can be achieved by S DC and D -» a. S —> ABC is not 
in the required form, and hence this production can be replaced by S AE and 
E -» BC Thus, an equivalent grammar is S —» AE | DC, E —> BC , A -» ft. 
B —^ b , C —> c, D —> 

The techniques applied in this example are used in the following theorem. 

Theorem 6.8 (Reduction to Chomsky normal form). For every context-free 
grammar, there is an equivalent grammar G 2 in Chomsky normal form. 

Proof (Construction of a grammar in CNF) 

Step 1 Elimination of null productions and unit productions: 

We apply Theorem 6.6 to eliminate null productions. We then apply 
Theorem 6.7 to the resulting grammar to eliminate chain productions. Let the 
grammar thus obtained be G = (V N , X, P, S). 

Step 2 Elimination of terminals on R.H.S.: 

We define G { = (L'y, X, P\. S'), where and V' N are constructed as follows: 

(i) All the productions in P of the form A —> a or A —» BC are included 
in P h All the variables in V N are included in V' N . 

(ii) Consider A —» X\X 2 . . . X n with some terminal on R.H.S. If is a 
terminal, say a b add a new variable C fl . to V% and C a . —> a { to P\. 
In production A —» X\X 2 . . . X„, every terminal on R.H.S. is replaced 
by the corresponding new variable and the variables on the R.H.S. are 
retained. The resulting production is added to P\. Thus, we get 
Gi = (VW, X, P h S). 

Step 3 Restricting the number of variables on R.H.S.: 

For any production in P\, the R.H.S. consists of either a single terminal (or 
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A in S A) or two or more variables. We define G 2 = (V/J, Z, P 2 , 5) as 
follows: 

(i) All productions in P 3 are added to P 2 if they are in the required form. 
All the variables in V' N are added to 

(ii) Consider A A]A 2 . . . A m , where m > 3. We introduce new 

productions A —» A { C 1? C 3 —» A 2 C 2 , ■ • •» Cw-2 A #ll _iA m and 
new variables C 1? C 2 , . . C m _ 2 . These are added to P" and V^u 

respectively. 

Thus, we get G 2 in Chomsky normal form. 

Before proving that G 2 is the required equivalent grammar, we apply the 
construction to the context-free grammar given m Example 6.11. 


EXAMPLE 6.11 


Reduce the following grammar G to CNF. G is S aAD . A —> aB \ bAB , 
B -> b,D cl 

Solution 

As there are no null productions or unit productions, we can proceed to step 2. 

Step 2 Let G { = (V' v . {a. b. cl }, P u S ). where P 3 and are constructed 
as follows: 

(i) B —» £, D d are included in P 3 . 

(ii) 5 —> aAD gives rise to 5 QAD and —> a. 

A aB gives rise to A C U B. 

A —» BAB gives rise to A C /; AB and Q, -> b. 

V's = {5. A. B. D. C„ Q}. 

Step 3 Pi consists of 5 —> C a AD , A —» QB j C h AB, B —> b. D —> <:/, C a —* a, 
C* —» A 

A —^ C a B. B —^ B, £1 —^ cl G cl —^ ^ b are added to P 2 

5 —» C^AD is replaced by S —> C a C\ and C\ —» AD. 

A —» C h AB is replaced by A —» C),C 2 and C 2 —> AP. 

Let 

G 2 = ({5. A. B. D, C a , C h . C h C 2 }. {m b, d), P 2 , 5) 

where P 2 consists of 5 —> C w Cj. A —» QP ) QC 2 , Cj AD, C 2 —> AP, 
B b, D d. C a —> < 3 , Q —* b. G 2 is in CNF and equivalent to G. 

Step 4 L(G) = L(G 2 ). To complete the proof we have to show that L(G) = 
L(G]) = L(G 2 ). 

To show that L(G) c L(G 3 ), we start with w e L(G). If A —» XiX 2 ... X„ 
is used in the derivation of w\ the same effect can be achieved by using the 
corresponding production in P 3 and the productions involving the new 

variables. Hence. A => X\X- . . . X tr Thus, L(G) c L(G 3 ). 

G, 
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Let w e L(Gj). To show that w e L(G ), it is enough to prove the 
following: 

A => w if A e V N . A 4 w (6.7) 

G ' Gi 

* 

We prove (6.7) by induction on the number of steps in A => w. 

C 

If A => vv, then A w is a production in P { . By construction of P h w is 
G ] 

a single terminal. So A —> w is in P, i.e. A => vv. Thus there is basis for 
induction. 

k + \ 

Let us assume (6.7) for derivations in at most k steps. Let A w. We can 

G i 

k * 

split this derivation as A => AiAo. . . A m => w x • = w such that A,- 

G^G G, 

* 

Each Aj is either in V N or a new variable, say C a , When Aj e V N , A, => w, 

‘ G \ 

* 

is a derivation in at most k steps, and so by induction hypothesis, A,- => w { . 
When A ; = C a .. the production Q. —» a,- is applied to get A f =b> w r The 
production A -» AjA 2 ... A m is induced by a production A —> X\X 2 ... X m 
in P where X, = A 2 if A 7 e V. v and X; = w. if A, = Q.. So A => XjX-* . . . X m 

1 G 

z=> vvpv^ . . . w m , i.e. A vv. Thus. (6.7) is true for all derivations. 

G G 

Therefore. L(G) = L(G X ). 

The effect of applying A —» A { A 2 . . . A,„ in a derivation for vv e L(Gi) 
can be achieved by applying the productions A —» AjCj, Q —» A 2 C 2 , . .., 
G„ 2_2 A^^A,,, in P 2 . Hence it is easy to see that L{G\) c L(G 2 ). 

To prove ZXG 2 ) c L(G]), we can prove an auxiliary result 

A => vv if A 6 V' N , A ==> vv (6.8) 

G, G : 

Condition (6.8) can be proved by induction on the number of steps in A w. 

g 2 

Applying (6.7) to S. we get L(G 2 ) c L(G). Thus, 

L(G) = L(G,) = L(G 2 ) I 


EXAMPLE 6.12 


Find a grammar in Chomsky normal form equivalent to S aAbB , A —» 

#A I g. P bB I b. 
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Solution 

As there are no unit productions or null productions, we need not carry out 
step 1. We proceed to step 2. 

Step 2 Let G { - (V' N { a . b}, P b 5), where P } and V' N are constructed as 
follows: 

(i) A —> a, B b are added to P x . 

(ii) S —> aAbB , A —> ciA. B —> bB yield S —> C a AC b B * A —> C^A-* 

P —^ C/,P, C a —) 77 , C b —> b> 

W = {S, A, P, c d , C*}. 

Step 3 Pj consists, of 5 -> C a AC h B y A — > C a A , P —> C ; ,P, C fl —> a. 
C/ 7 —^ b , A —> r?, P —^ b. 

S —» C a AC b B is replaced by 5 —» QC b Cj —» AC 2 . C 2 —» QP 
The remaining productions in P] are added to P 2 . Let 

G 2 = ({5, A, P, Q. Q, C 1? C 2 }. {a. P 2 . 5), 

where P 2 consists of 5 —» C a Cj, C\ —> AC 2 , C 2 —> C b B, A —> C a A, P —> QP, 
^ 77, C b —>■ b. A —> 77. and P —^ b. 

G 2 is in CNF and equivalent to the given grammar. 


EXAMPLE 6.13 


Find a grammar in CNF equivalent to the grammar 

S -> -5 | [5 d) 5] | p | g (5 being the only variable) 

Solution 

As the given grammar has no unit or null productions, we omit step 1 and 
proceed to step 2. 

Step 2 Let G { - (V' N , E. P b 5). where Pj and V' N are constructed as follows: 

(i) S —> p|t/ are added to P L . 

(ii) S —» ~ S induces S —> AS and A 

(iii) 5 [5 => S] induces S -> BSCSD , P -» [,C —» z>, £> ->] 

Vy = {5. A, P, C. D} 

Step 3 Pi consists of S —» p | t/. 5 -» AS. A —> P —» [,C —» 3, Z) —>], 

5 -> BSCSD . 

5 -> BSCSD is replaced by 5 PC h Q 5C 2 . C 2 CC 3 , C 3 SD. 
Let 

G 2 = ({5, A. P. C, A Cj. C 2 , C 3 }. I, P 2 , S) 

where P 2 consists of 5 —» /? | |AS | P<r 1? A —> P —> [. C —> 3, D —»], 
C[ —» PC 2 , C 2 —> CC 3 , C 3 —» SZ). G 2 is in CNF and equivalent to the given 


grammar. 
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6.4.2 Greibach Normal Form 

Greibach normal form (GNF) is another normal form quite useful in some 
proofs and constructions. A context-free grammar generating the set accepted 
by a pushdown automaton is in Greibach normal form as will be seen in 
Theorem 7.4. 

Definition 6.12 A context-free grammar is in Greibach normal form if every 
production is of the form A —» aa . where a e V 7 * and a e £(a may be A), 
and 5 —> A is in G if A 6 L(G). When A e L(G), we assume that S 
does not appear on the R.H.S. of any production. For example, G given by 
S —^ aAB [ A, A — > bC.\ B —^ A C —^ c is in GNF. 

Note: A grammar in GNF is a natural generalisation of a regular grammar. 
In a regular grammar the productions are of the form A —» aoc, where ag! 
and a e V N u {A}, i.e. A aa ; with aV% and | a\ < 1. So for a grammar 
in GNF or a regular grammar, we get a (single) terminal and a string of 
variables (possibly A) on application of a production (with the exception of 
S —> A). 

The construction we give in this section depends mainly on the following 
two technical lemmas: 

Lemma 6.1 Let G = (V N , I, P, S) be a CFG. Let A —» By be an A-production 
in P. Let the 5-productions be B —» /3] | /? 2 | • • • i A- Define 

P { = (P - {A -> By}) u {A A7 |1 < / < *}. 

Then. Gj = (L v , Z, P\> S) is a context-free grammar equivalent to G. 

Proof If we apply A —» By in some derivation for w e L(G), we have to 

apply B —> A f° r some / at a later step. So A => Bj. The effect of applying 

A By and eliminating B in grammar G is the same as applying A -> A7 
for some i in grammar G^ Hence w e L(G |), i.e. L(G) c L(G f ). Similarly, 
instead of applying A Ay* we can a PP'y A —» By and 5 —> A t0 § et 
A => A)< This proves L(G l ) c L(G). I 

G 

Lemma 6.1 is useful for deleting a variable 5 appearing as the first 
symbol on the R.H.S. of some A-production, provided no 5-production has 5 
as the first symbol on R.H.S. 

The construction given in Lemma 6.1 is simple. To eliminate 5 in A —» 5y 
we simply replace 5 by the right-hand side of every 5-production. 

For example, using Lemma 6.1, we can replace A Bab by A aAab , 
A —» bBab. A —» <:/ggZ?. A A5aZ? when the 5-productions are 5 —^ aA | bB | 
r/r/1A 5. 

The lemma is useful to eliminate A from the R.H.S. of A —> Aa. 

Lemma 6.2 Let G = (V v , Z, 5, 5) be a context-free grammar. Let the set 
of A-productions be A —» Ao^i | . . . Aa r | A| ■ • • IA (A ?s do not start with A). 
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Let Z be a new variable. Let G x = (V N u {Z}, X, P 3 , S), where P { is defined 
as follows: 

(i) The set of A-productions in P x are A —» A IA I • • ■ ! A 

A AZ| ftzl ... | p s z 

(ii) The set of Z-productions in are Z a x \a 2 \ ... \<X r 

Z —> OfjZ | o.CZ, • • ♦ j a 7 Z 

(iii) The productions for the other variables are as in P. Then G 3 is a CFG 
and equivalent to G. 

Proof To prove L(G) e L(G), consider a leftmost derivation of w in G. The 
only productions in P - P x are A —> Aaj | Aa 2 . . . | Aa r . If A Aa iv 
A —> AcCj t A Aa /A are used, then A A should be used at a later 

stage (to eliminate A). So we have A fijAa^ . . . a,,. while deriving w in 
G. However, 


^ 7 ? 7 ? Pi a :]Z * * • 7 ? Pj<X i{ <Xi 2 • • • 

G; G, G, 

i.e. 

A => fijA0Cj l . . . Of/jj. 

Thus. A can be eliminated by using productions in G 3 . Therefore, w e L(G 3 ). 

To prove L(G 3 ) c L(G), consider a leftmost derivation of w in G^ The 
only productions in Pi - P are A —» AZ| j 8 2 Z| . . . |A-Z Z —> ad ... |a n 
Z -» a x Z\a 2 Z | . . . |a,Z If the new variable Z appears in the course of the 
derivation of w. it is because of the application of A —> fijZ in some earlier 
step. Also, Z can be eliminated only by a production of the form Z —» <Xj or 

Z —> (XjZ for some i and j in a later step. So we get A => PjOj OCi .. . (Xj 

G\ 

in the course of the derivation of w. But, we know that A => A°h a n • • • a ir 

G ' 1 * 

Therefore, w e L(G). I 



Apply Lemma 6.2 to the following A-productions in a context-free grammar 
G. 

A —> aBD | bDB | c, A —» AB \ AD 

Solution 

In this example, a x - B. a 2 - D, A = aBD . A = As - S° the new 
productions are: 

(i) A aBD\bDB\c . A aBDZ\bDBZ\cZ 

(ii) Z A Z -> D. Z ^ SZ|DZ 
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Theorem 6*9 (Reduction to Greibach normal form). Every context-free 
language L can be generated by a context-free grammar G in Greibach normal 
form. 

Proof We prove the theorem when A £ L and then extend the construction 
to L having A. 

Case 1 Construction of G (when A € L): 

Step 1 We eliminate null productions and then construct a grammar G in 
Chomsky normal form generating L We rename the variables as 
A b A : , . . .,A n with S = A f . We write G as ({A,, A 2 , — A,,}, I, P, A x ). 

Step 2 To get the productions in the form A z - —» ay or A } —» A ; -y where 
j > i. convert the A r productions (i = 1, 2. . . ., n - 1) to the form A, A{y 
such that j > i. Prove that such modification is possible by induction on i. 

Consider A]-productions. If we have some A!-productions of the form 
A i —> A { y then we can apply Lemma 6.2 to get rid of such productions. We 
get a new variable, say Z b and A j -productions of the form Aj -» a or A { —> Apy\ 
where j > 1. Thus there is basis for induction. 

Assume that we have modified A { -productions, A 2 -productions . . . 
A/-productions. Consider A /+1 -productions. Productions of the form 
A/ + i —» ay required no modification. Consider the first symbol (this will be 
a variable) on the R.H.S. of the remaining A /+} -productions. Let t be the 
smallest index among the indices of such symbols (variables). If t > i + 1, 
there is nothing to prove. Otherwise, apply the induction hypothesis to 
Arproductions for t < i. So any A r production is of the form A t —» A ; y 
where j > t or A t ay'. Now we can apply Lemma 6.1 to A /+1 -production 
whose R.H.S. starts with A t . The resulting A /+1 -productions are of the form 
A f+1 -» A { y where j > t (or A i+{ -> ay'). 

We repeat the above construction by finding t for the new set of 
A/ + i-productions. Ultimately, the A /+1 -productions are converted to the form 
Aj+[ A;/; where j > i + 1 or A i+ \ ay'. Productions of the form 
A /+1 —> A j+1 y can be modified by using Lemma 6.2. Thus w*e have converted 
A /+ i-productions to the required form. By the principle of induction, the 
construction can be carried out for i = 1,2,..., n. Thus for f = 1.2,..., 
n — 1, any A r production is of form A f —^ A ? -y where j > i or A i —> ay'. Any 
A ;r production is of the form A n —> A n y or A n —> ay' . 

Step 3 Convert A n -productions to the form A n —> ay Here, the productions 
of the form A n —> A n y are eliminated using Lemma 6.2. The resulting 
A 72 -productions are of the form A n ay 

Step 4 Modify the A r productions to the form A, —» ay for i = 1. 2, .... 
n - 1. At the end of step 3, the A /r productions are of the form A n —» ay 
The A /z _!-productions are of the form A n ^ —» ay' or A„_] —> A n y By applying 
Lernma 6.1. we eliminate productions of the form A n _ { A n y The resulting 





Chapter 6; Context-Free Languages a 209 


A n _ x -productions are in the required form. We repeat the construction by 
considering A„_ 2 , A„_ ? , .... A,. 

Step 5 Modify Z r productions. Every time we apply Lemma 6.2, we get a 
new variable. (We take it as Z { when we apply the Lemma for A r productions.) 
The Zfproductions are of the form Z,- —> aZ, or Z ; —» a (where a is obtained 
from A; —> A,-a), and hence of the form Z ? —» <7/ or Z ; A*y for some L 
At the end of step 4. the R.H.S. of any A r production starts with a terminal. 
So we can apply Lemma 6.1 to eliminate Z i —> A k y. Thus at the end of 
step 5, we get an equivalent grammar G\ in GNF. 

It is easy to see that G\ is in GNF. We start with G in CNG. In G any 
A-production is of the form A —» a or A —» AB or A —» CD . When we apply 
Lemma 6.1 or Lemma 6.2 in step 2, we get new productions of the form 
A —» aa or A —» /3, where a e V* and /3 e and a s X. In steps 3-5, 
the productions are modified to the form A —> aa or Z —> a'a\ where a, rt 
e I and a. a/ e 

Case 2 Construction of G when A € L: 

By the previous construction we get G' = (V 7 v, X, Fj, S) in GNF such that 
L((j) - L - {A}. Define a new grammar G x as 

G { = {V' N u {S'}, X. ?i u {S 7 S, S' A}, S') 

S' —> S can be eliminated by using Theorem 6.7. As S-productions are in the 
required form, S'-productions are also in the required form. So L(G) - L(G\) 
and G\ is in GNF. I 

Remark Although we convert the given grammar to CNF in the first step, 
it is not necessary to convert all the productions to the form required for CNF. 
In steps 2-5. we do not disturb the productions of the form A —» aa , a e X 
and a e V% So such productions can be allowed in G (in step 1). If we apply 
Lemma 6.1 or 6.2 as in steps 2-5 to productions of the form A —> a. where 
a € V% and j a\ > 2, the resulting productions at the end of step 5 are in the 
required form (for GNF). Hence we can allow productions of the form A —> a, 
where a e V% and | a\ > 2. 

Thus we can apply steps 2-5 to a grammar whose productions are either 
A —> aa where a e V*. or A —> a e L* where | a\ > 2. To reduce the 
productions to the form A a € where \a\ > 2, we can apply step 2 
of Theorem 6.8. 


EXAMPLE 6.15 


Construct a grammar in Greibach normal form equivalent to the grammar 
S ^ AA \ a. A —> SS \ b. 

Solution 

The given grammar is in CNF. S and A are renamed as Aj and A 2 . 
respectively. So the productions are A } —> A { A 2 \a and A 2 —> A\A\ \ b. As the 
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given grammar has no null productions and is in CNF we need not carry out 
step 1. So we proceed to step 2. 

Step 2 (i) A]-productions are in the required form. They are A x —> A 2 A 2 \ a. 

(ii) A 2 b is in the required form. Apply Lemma 6.1 to A 2 —> A x A b 
The resulting productions are A 2 —> A 2 A 2 A l? A 2 —» ciA x . Thus the 
A 2 -productions are 

A 2 —^ AvLA b A 2 —^ Mj, A 2 —y b 

Step 3 We have to apply Lemma 6.2 to A 2 -productions as we have 
A 2 —> A 2 A 2 A!. Let Z 2 be the new variable. The resulting productions are 

A 2 —)■ flfAj, A 2 —^ Z? 

A 2 —^ rzA]Z2» A 2 —^ bZ 2 

Z 2 —> A 2 A], Z 2 —> A 2 A |Z 2 . 

Step 4 (i) The A 2 -proauctions are A 2 —> aAi\b\aA x Z 2 \bZ 2 . 

(ii) Among the A r -productions we retain Aj —> d? and eliminate 
A] —» A 2 A 2 using Lemma 6.1. The resulting productions are A\ —> aA { A 2 \bAy 
Ai —> aA { Z 2 A 2 \bZ 2 A 2 . The set of all (modified) A r productions is 

A\ —> ci j ciA \A 2 j bA 2 j ciA\Z 2 A 2 J bZ 2 A 2 

Step 5 The Z 2 -productions to be modified are Z 2 A 2 A b Z 2 —> A 2 A{Z 2 . 
We apply Lemma 6.1 and get 

Zt —^ aA ] A ] | Z?A{ j uA x Z^A\ j bZ 2 A x 

Z 2 —> dzA j A i Z 2 [ Z?A j Z 2 J dzA j Z 2 A \ Z 2 J bZ 2 A ] Z 2 

Hence the equivalent grammar is 

G' = ({A h A 2 , Z 2 } ? {a, b} 9 P h AO 

where P x consists of 

A] —¥ q j ciAjA-) j bA 2 1 uA x Z^A x j bZ 2 A 2 

A 2 —^ ciA\ | Zz | dzA \Z 2 1 bZ 2 

Z 2 —^ clA i A} | bA i J ci A j Z 2 A j [ bZ 2 A ] 

Z 2 —^ dzA | A ] Z 2 | ZzA j Z 2 j dzA ] Z 2 A j Z 2 j bZ 2 A ] Z 2 


EXAMPLE 6.16 


Convert the grammar S —> AB, A —» 1 Zz, £ SA | a into GNF. 

Solution 

As the given grammar is in CNF, we can omit step 1 and proceed to step 2 
after renaming 5. A. 5 as A b A 2 , A 3 . respectively. The productions are A x 
A 2 A 2 . A 2 — ^ A'Ai j Zz, A 3 —^ A]A-> j ci. 
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Step 2 (i) The A r production A] A 2 A 3 is in the required form. 

(ii) The A-.-productions A 2 —» AyA [ | b are in the required form. 

(iii) A 3 -> a is in the required form. 

Apply Lemma 6.1 to A 3 —> AjA 2 . The resulting productions are A 3 A 2 A 3 A 2 . 
Applying the lemma once again to A 3 —> AAyL- we get 

A 3 —> AyA[AyA 2 | Z?A 3 A 2 . 

Step 3 The A 3 -productions are A 3 —> a ! bA 3 A 2 and A 3 —» A 3 A 1 A 3 A 2 . As we 
have A 3 -4 AAiAA:- we have to apply Lemma 6.2 to A 3 -productions. Let 
Z 3 be the new variable. The resulting productions are 

A 3 —> a j bAyA 2 , A 3 —^ #Z 3 | bAyA 2 Z 3 

Z 3 —> A 1 AA 2 , Z 3 —^ AAA 2 Z 3 

Step 4 (i) The A 3 -productions are 

Ay —> o .} Z7AA2I j bAyAyZy ( 6 * 9 ) 

(ii) Among the A 2 -productions, we retain A 2 b and eliminate A 2 —» 
A 3 A 1 using Lemma 6.1. The resulting productions are 

An —¥ Cl A | | bAyAyA] |nZ 3 A j bAyA 2 ZyA\ 

The modified A 2 -productions are 

A^ ^ Z? | flA] J Z?A 3 A 2 A] j r/Zylj [ M ( 6 , 10 ) 

(iii) We apply Lemma 6.1 to A\ —» A 2 A 3 to get 

A t —> Z?A 3 j ^A]A 3 J £>A 3 A 2 A]A 3 ( nZ 3 A]A 3 j bAyA 2 ZjA\Ay (6.11) 
Step 5 The Z 3 -productions to be modified are 

Z 3 —^ A]AA 2 1 AjAA 2^ 3 
We apply Lemma 6.1 and get 

Z 3 —^ bAyAyAy | bAyAyZy 

Z 3 —^ aA)AyAyA 2 1 ciA\AyAyA 2 Zy 

Z 3 M3A2A1A3A3A2 1 MAAAAA2Z3 ( 6 , 12 ) 

Z 3 —^ aZAjAAA^ | ClZyA \ AyAyA 2 Zy 

Z 3 —> ^A 3 A 2 Z 3 A}A 3 A 3 A 2 j Z?A 3 A 2 Z 3 A jA 3 A 3 A 2 Z 3 

The required grammar in GNF is given by (6.9)—(6.12). 

The following example uses the Remark appearing after Theorem 6.9. In 
this example we retain productions of the form A —» aa and replace the 
terminals only when they appear as the second or subsequent symbol on 
R.H.S. (Example 6.17 gives productions to generate arithmetic expressions 
involving a and operations like +. * and parentheses.) 
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Find a grammar in GNF equivalent to the grammar 

E —> E + 7|7, 7 -> 7*7|7 F (7)|a 

Solution 

Step 1 We first eliminate unit productions. Hence 

W 0 (E) = {7}, W X (E) = {E} u {7} = {7, 7} 

W,(7) = {7, 7} u {7} = {7, 7, 7} 

So, 

W(E) = {7, 7. 7} 

So, 

Wq(7) = {7}, Wffl = {7} u {7} = {7, 7} 

Thus. 

W(7) = {7, 7} 

Wq(7) = {7}, WK7) = {7} - M7 

The equivalent grammar without unit productions is, therefore, G x - (V N , 
X. Pi, 5), where P { consists of 

(i) 7 -> 7 4- 71 7 * 71 (7) | a 

(ii) 7 —» 7 * 7 ] (7) | a. and 

(iii) 7 —» (7) | a. 

We apply step 2 of reduction to CNF. We introduce new variables A, 7 
C corresponding to +, *.). The modified productions are 

fi) 7 7A7| 7771 (7C| a 

(ii) 7 -> TBF\(EC\a 

(iii) 7 —» (7C | a 

(iv) A +. B -> *, C ) 

The variables A. 7. C, 7. 7 and 7 are renamed as A^ A 2? A 3 , A 4 . A 5 , A 6 . 
Then the productions become 

A] —> +, A 2 —^ *. A 3 —> ). A 4 —^ (A^A 3 1(6.13) 

A 5 -■> A 5 A 2 A 4 | (A(yA? | ^ 

A 5 —> A^A 1 A 5 j A 5 A 2 A 41 (A^A 3 1 ci 

Step 2 We have to modify only the Ay and A 6 -productions. A 5 —> A 5 A 2 A 4 
can be modified by using Lemma 6.2. The resulting productions are 

As —^ (A 6 A 3 j ci. A 5 —> (A 5 A 3 X 5 J ^ 2 X 5 (6.14) 

—> A 2 A 4 | A 2 A 4 Z 5 

A 6 —» A 5 A 2 A 4 can be modified by using Lemma 6.1. The resulting 
productions are 

A 5 —> (A 5 A 3 A 2 A 41 C 1 A 2 A 41 (A 5 A 3 X 5 A 2 A 4 j ciZ^A^A^ 

A 6 (AgA^lfl are in the proper form. 
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Step 3 A 6 —» A 6 A l A 5 can be modified by using Lemma 6.2. The resulting 
productions give all the A 6 -productions: 

Aft —> (A5A3A2A4 J ciAyA^ | (A6A3Z5A2A4 

A 6 —> ciZtyA-tA^. j J a (6.15) 

A () —^ (A (jA 3 A 2 A fZ^ J ciA^AfZ^ j (A (yA fZ^A 2 A 4 . 2 ^ 

A 6 —^ ciZ^A^A^Z^ | (AfiA^Zfi j dZfr (6.16) 

A 6 -> AiA 5 \A { A 5 Z 6 

Step 4 The step is not necessary as A-productions for i = 5, 4, 3, 2. 1 are 
in the required form. 

Step 5 The Z 5 -productions are Z 5 -» A 2 A 4 IA 0 A 4 Z 5 . These can be modified 
as 

Z 5 —> * A 4 |* A 4 Z 5 <6.17 ) 

The Z 6 -productions are Z 6 —> A 1 A 5 |A ] A 5 Z 6 . These can be modified as 

Z 6 -> + A 5 |+ A 5 Z 6 (6.18) 

The required grammar in GNF is given by (6.13)—(6.18). 

6.5 PUMPING LEMMA FOR CONTEXT-FREE 
LANGUAGES 

The pumping lemma for context-free languages gives a method of generating 
an infinite number of strings from a given sufficiently long string in a context- 
free language L. It is used to prove that certain languages are not context-free. 
The construction we make use of in proving pumping lemma yields some 
decesion algorithms regarding context-free languages. 

Lemma 6.3 Let G be a context-free grammar in CNF and T be a derivation 
tree in G. If the length of the longest path in T is less than or equal to k, then 
the yield of T is of length less than or equal to 2 k ~ [ . 

Proof We prove the result by induction on k , the length of the longest path 
for all A-trees (Recall an A-tree is a derivation tree whose root has label A). 

When the longest path in an A-tree is of length L the root has only one son 
whose label is a terminal (when the root has two sons, the labels are variables). 
So the yield is of length 1. Thus, there is basis for induction. 

Assume the result for k - 1 (k > 1). Let T be an A-tree with a longest path 
of length less than or equal to k. As k > L the root of T has exactly two sons 
with labels A { and A 2 . The two subtrees with the two sons as roots have the 
longest paths of length less than or equal to k - 1 (see Fig. 6,12), 

If Wj and h ' 2 are their yields, then by induction hypothesis, | wq | < 2 k ~ 2 , 

| W '2 1 < 2 k ~ 2 . So the yield of T = uqw : . fuqw 2 | < 2 k ~ 2 + 2 k ~ 2 = 2 k ~\ By the 
principle of induction, the result is true for all A-trees, and hence for all 
derivation trees. 
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Fig. 6.12 Tree T with subtrees T\ and T 2 * 

Theorem 6.10 (Pumping lemma for context-free languages). Let L be a 
context-free language. Then we can find a natural number n such that: 

(i) Every z e L with \z\ ^ n can be written as uvwxy for some strings 
u, v, w, x, y. 

(ii) \vx\ > 1. 

(iii) | vwx | < n. 

(iv) uv k w^y € L for all k > 0, 

Proof By Corollary 1 of Theorem 6.6, we can decide whether or not A e L. 
When A e L, we consider L - { A} and construct a grammar G = (V#, E, P, S ) 
in CNF generating L - {A} (when A g L, we construct G in CNF generating 
L). 

Let \V N \ = m and n = T\ To prove that n is the required number, we 

start 

with z e L, | z | ^ 2 m , and construct a derivation tree T (parse tree) of z- If 
the length of a longest path in T is at most m, by Lemma 6.3, |z| < 2 m ~ ] (since 
z is the yield of 7). But \z\ > 2 m > 2 U ~\ So T has a path, say T, of length 
greater than or equal to m + 1. T has at least m + 2 vertices and only the last 
vertex is a leaf. Thus in T all the labels except the last one are variables. As 
| V N | = w, some label is repeated. 

We choose a repeated label as follows: We start with the leaf of T and 
travel along F upwards. We stop when some label, say B , is repeated. (Among 
several repeated labels, B is the first.) Let v\ and v 2 be the vertices with label 
B , vj being nearer the root. In T. the portion of the path from to the leaf has 
only one label namely B , which is repeated, and so its length is at most m + 1. 

Let T\ and 71 be the subtrees with v 1? v 2 as roots and zu w as yields, 
respectively. As T is a longest path in T, the portion of T from v { to the leaf 
is a longest path in T\ and of length at most m + 1. By Lemma 6.3, |zi | < 2 m 
(since z\ is the yield of T { ). 

For better understanding, we illustrate the construction for the grammar 
whose productions are S —> AB , A —» aB \ a, B —> bA \ b, as in Fig. 6.13. In 
the figure, 

F = 5 —^ A —)■ B —> A —^ B —^ b 

z = ababb, Z\ - bab , w = b 
v = ba , v = A, w = a , y = b 



Chapter 6: Context-Free Languages B 215 


As s and z\ are the yields of T and a proper subtree T x of 7, we can write 
z = uz\y * As z\ and w are the yields of T\ and a proper subtree T 2 of T u we 
can write z\ = vwx. Also, | vwx | > | w \. So, | vx | > 1. Thus, we have z = uvwxy 
with | vwx | < n and j vx| > 1. This proves the points (i)-(iii) of the theorem. 

As T is an 5-tree and T h T 2 are 5-trees, we get S =b> uBy\ B => vBx and 
B => w. As S => uBy =e> mvy , zzv 0 wx°y e L. For fc > 1, 5 => w5v => uv k B^y 
zzvSvx*}’ e L. This proves the point (iv) of the theorem. I 



Fig. 6.13 Tree T and its subtrees 7^ and T 2 . 


Corollary Let L be a context-free language and n be the natural number 
obtained by using the pumping lemma. Then (i) L ^ 0 if and only if there 
exists w e L with | w | < n, and (ii) L is infinite if and only if there exists 
z e L such that n < M < 2 n. 
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Proof (i) We have to prove the ‘only if part. If z e L with \z\ > n, we 
apply the pumping lemma to write e = uvwxy , where 1 < |vjc| < /?. Also, 
uwy e L and | awy | < | c |. Applying the pumping lemma repeatedly, we can 
get f e L such that | -'| < n. Thus (i) is proved. 

(ii) If z e L such that n < | - [ < In. by pumping lemma we can write 
- = uvwxy. Also. uv k wx k y e L for all k > 0. Thus we get an infinite number 
of elements in L. Conversely, if L is infinite, we can find z e L with \z \ > n. 
If | -1 < 2/1, there is nothing to prove. Otherwise, we can apply the pumping 
lemma to write £ = uvwxy and get uwy e L Every time we apply the pumping 
lemma we get a smaller string and the decrease in length is at most n (being 
equal to j vx\). So, we ultimately get a string z in L such that n < | z | < 2 n. 
This proves (ii). I 

Note: As the proof of the corollary depends only on the length of vx, we can 
apply the corollary to regular sets as well (refer to pumping lemma for regular 
sets). 

The corollary given above provides us algorithms to test whether a given 
context-free language is empty or infinite. But these algorithms are not efficient. 
We shall give some other algorithms in Section 6 . 6 . 

We use the pumping lemma to show that a language L is not a context- 
free language. We assume that L is context-free. By applying the pumping 
lemma we get a contradiction. 

The procedure can be carried out by using the following steps: 

Step 1 Assume L is context-free. Let n be the natural number obtained by 
using the pumping lemma. 

Step 2 Choose z e L so that \z\ > n. Write c = uvwxy using the pumping 
lemma. 

Step 3 Find a suitable k so that uv k wx k y <£ L. This is a contradiction, and so 
L is not context-free. 


EXAMPLE 6.18 


Show that L = {a n b u c ]1 \n > 1} is not context-free but context-sensitive. 

Solution 

We have already constructed a context-sensitive grammar G generating L (see 
Example 4.11). We note that in every string of L, any symbol appears the 
same number of times as any other symbol. Also a cannot appear after b , and 
c cannot appear before b . and so on. 

Step 1 Assume L is context-free. Let n be the natural number obtained by 
using the pumping lemma. 

Step 2 Let - = d l b n c n . Then | -| = 3n > n. Write z = uvwxy, where \vx\ > 1, 
i.e. at least one of v or x is not A. 
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Step 3 uvwxy- = ci n b n c n . As 1 < | vx\ < n, v or x cannot contain all the three 
symbols cl b. c. So, (i) v or x is of the form dd (or b l d) for some ?, j such that 
i + j < n. Or (ii) v or x is a string formed by the repetition of only one symbol 
among a. b , c. 

When v or x is of the form did, v 2 = did did (or x 2 - did did ). As v 2 is a 
substring of uv\vx 2 y. we cannot have uv 2 wx 2 y of the form d n b m c m . So, 
z/vurrv £ L. 

When both v and x are formed by the repetition of a single symbol (e.g. 
u - d and v = V for some i and j\ i < n,j < ri), the string uwy will contain the 
remaining symbol, say ci\. Also, af will be a substring of uwy as does not 
occur in v or .v. The number of occurrences of one of the other two symbols 
in uwy is less than n (recall uvwxy - d'b n c n ), and n is the number of occurrences 
of flj. So uv 0 wx°y ~ uwy £ L. 

Thus for any choice of v or x, we get a contradiction. Therefore, L is not 
context-free. 


EXAMPLE 6.19 


Show that L - {cf\p is a prime} is not a context-free language. 

Solution 

We use the following property of L: If vv e L, then |w| is a prime. 

Step 1 Suppose L = L(G) is context-free. Let n be the natural number 
obtained by using the pumping lemma. 

Step 2 Let p be a prime number greater than n. Then z ~ a p e L We write 
~ = uvM'xy. 

Step 3 By pumping lemma, uv°wx°y - uwy € L. So | uwy \ is a prime 
number, say q. Let | v,r | = r. Then, | uv q wx q y | = q + qr. As q + qr is not a 
prime, uv q wx q y £ L. This is a contradiction. Therefore, L is not context-free. 


6.6 DECISION ALGORITHMS FOR CONTEXT-FREE 
LANGUAGES 

In this section we give some decision algorithms for context-free languages and 
regular sets. 

(i) Algorithm for deciding whether a context-free language L is empty. 
We can apply the construction given in Theorem 6.3 for getting 
Lv = W k . L is nonempty if and only if S e W k . 

' ii) Algorithm for deciding whether a context-free language L is finite. 
Construct a non-redundant context-free grammar G in CNF generating 
L - {A}. We draw a directed graph whose vertices are variables in 
G. If A —» BC is a production, there are directed edges from A to B 
and A to C. L is finite if and only if the directed graph has no cycles. 
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(iii) Algorithm for deciding whether a regular language L is empty. 
Construct a deterministic finite automaton M accepting L. We construct 
the set of all states reachable from the initial state q 0 . We find the 
states which are reachable from q 0 by applying a single input symbol. 
These states are arranged as a row under columns corresponding to 
every input symbol. The construction is repeated for every state 
appearing in an earlier row. The construction terminates in a finite 
number of steps. If a final state appears in this tabular column, then 
L is nonempty. (Actually, we can terminate the construction as soon 
as some final state is obtained in the tabular column.) Otherwise, L 
is empty. 

(iv) Algorithm for deciding whether a regular language L is infinite. 
Construct a deterministic finite automaton M accepting L. L is infinite 
if and only if M has a cycle. 


6.7 SUPPLEMENTARY EXAMPLES 


EXAMPLE 6.20 


Consider a context-free grammar G with the following productions, 

S -4 ASA | B 
B —> aCb | bCa 
C"-> ACA | A 
A a | b 

and answer the following questions: 

(a) What are the variables and terminals of G? 

(b) Give three strings of length 7 in L(G). 

(c) Are the following strings in L(G)? 

(i) aaa (ii) bbb (iii) aba (iv) abb 

(d) True or false: C => bab 

(e) True or false: C => bab 

(f) True or false: C abab 

(g) True or false: C => AAA 

(h) Is A in 1(G)? 

Solution 

(a) y v = {5, A. B. C } and I = {a, bj 

(b) S => A 2 SA 2 => A 2 BA 2 => A~aCbA 2 => A 2 aAbA 2 ababbab 
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So ababbab e L(G). 

S => A 2 aAbA 2 (as in the derivation of the first string) 

=> aaaabaa 

S => A 2 aAbA 2 => bbabbbb 

So ababbab , aaaabaa , bbabbbb are in L(G). 

(c) S —» ASA. If 5 =b> w, then w starts with a and ends in £> or vice versa 
and | w [ > 3. If aaa is in L(G), then the first two steps in the 
derivation of aaa should be S => ASA => AFA or S => aita. The 
length of the terminal string thus derived is of length 5 or more. 
Hence aaa £ L(G). A similar argument shows that bbb <£ L(G ). 

S => B => acb =» flAfo So e L(G) 

(d) False, since the single-step derivations starting with C can only be 
C => ACA or C => A. 

(e) C => ACA => AAA =?> feafc. True 

(f) Let vi' = abab If C => w, then C => ACA w or C => A => w. 
In the first case | w | = 3, 5. 7. . . . . As | w | = 4, and A => w if and 
only if w = a or b, the second case does not arise. Hence (f) is false. 

(g) C ^ ACA => AAA. Hence C AAA is true. 

(h) A e L(G). 


EXAMPLE 6.21 


If G consists of the productions S —» aSa | Z?S£ | aSb \ bSa | A, show that L(G) 
is a regular set. 

Solution 

First of all, we show that L(G) consists of the set L of all strings over {a, b }, 
of even length. It is easy to see that L(G) c L. Consider a string w of even 
length. Then, w = a x a 2 . . . a 2n _ x a 2n where each a { is either a or b . Hence 

S = £‘ a\Sa 2}1 ^1^2 * ■ - ^22^^224-1 * • ■ ^2/2 $i<2 2 * • * ^2/7* 

Hence L c L(G). 

Next we prove that L = L(Gi) for some regular grammar G x . Define 
G } = ({5, Si, S 2; S 3 , S 4 }, {a, &}, F, S) where P consists of S —> aS x , S x —» aS, 
S -> aS 2 , S 2 -> bS, S -> W 3 , S 3 -> feS, S &S 4 , S 4 -> aS, S -4 A. 

Then S #ia 2 S where a x = a or b and a 2 = a or b. It is easy to see that 
L(Gj) = L. As Gi is regular, L(G) = L(G0 is a regular set. 


EXAMPLE 6.22 


Reduce the following grammar to CNF: 

S —^ ASA j £>A, A —^ 5 j S, B —^ c 
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Solution 

Step 1 Elimination of unit productions: 

The unit productions are A —» B, A —» S. 

Wb(5)= {S}, W,(5) = {5} u 0 = {5} 

W 0 (A) = {A}, W\(A) = {A} u {5, B} = {5, A, 5} 

W 2 (A)= {5, A, 5} u 0 = {S, A, 5} 

W 0 (B)= {£}, fl^B) = {£} u 0 = {£} 

The productions for the equivalent grammar without unit productions are 

S —^ ASA | bA, B —^ c 

A —> ASA | bA, A c 

So, Gi - ({S, A, B}, {&, c}, P, S) where P consists of S -» ASA | M. 
5 —> c, A —^ ASA j bA j c. 

Step 2 Elimination of terminals in R.H.S.: 

S ASA. B —» c, A —> ASA j c are in proper form. We have to modify 
S —» M and A -> M. 

Replace S M by S —> C /y 4, Q, —> £> and A M by A -» QA, 
C /? —» b. 

So, G 2 = ({S, A. 5, Q}, {f>, c}, P 2 , 5) where P 2 consists of 

S -> ASA | CftA 
A —> ASA [ c | CfoA 
B —> c, C^ —^ Z? 

Step 3 Restricting the number of variables on R.H.S.: 

S —> ASA is replaced by S —» AD, D -» SA 
A —» ASA is replaced by A —> AD, E SA 
So the equivalent grammar in CNF is 

G 3 = ({S. A, B, Q, D, £}, {6, c}, P 3 , S) 
where P 3 consists of 

S-> C }y A | AD 
A c\C ly A\AE 

B —^ c, C h —^ £?, D —^ SA, D —^ SA 


EXAMPLE 6.23 


Let G = (V^, S, P. S) be a context-free grammar without null productions or 
unit productions and k be the maximum number of symbols on the R.H.S. of 
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any production of G. Show that there exists an equivalent grammar G x in 
CNF, which has at most (k - l)jP| + |Zj productions. 

Solution 

In step 2 (Theorem 6 . 8 ), a production of the form A X x X 2 ... X n is 
replaced by A -> Y x Y 2t . . ., Y n where Y { = X { if X e V N and Y t is a new 
variable if X i e Z. We also add productions of the form Y { —> X) whenever 
X; e I. As there are | Z | terminals, we have a maximum of | Z j productions of 
the form Y i —> X { to be added to the new grammar. In step 3 (Theorem 6 . 8 ), 
A —> A { A 2 . . . A n is replaced by n - 1 productions, A —> A X D X , D x —> A 2 D 2 
. . . D n _ 2 —> A n ~\A n . Note that n < k. So the total number of new productions 
obtained in step 3, is at most (k- 1) | P |. Thus the total number of productions 
in CNF is at most (k - 1)|P| + |Z|. 


Example 6.24 


Reduce the following CFG to GNF: 

S —> ABb\a, A —» aaA , B —» bAb 

Solution 

The vahd productions for a grammar in GNF are A -» cia, where a e Z, 
a € V N . 

So, S —» APZ? can be replaced by 5 APC, C —> b. 

A —> tzaA can be replaced by A -» ^Z)A, D a. 

B Z?A£> can be replaced by B MC. C b. 

So the revised productions are: 

S —> APC j cz, A —> ciDA , P —^ bAC . C —> b, D —> 

Name S, A, 5, C, D as A h Ay A 3 , A 4 , A 5 . 

Now we proceed to step 2. 

Step 2 G\ = ({At, A 2 , A 3 , A 4 , A 5 }, {a, Z?}, P 1? Aj) where Pj consists of 

A| ^ A 2 A 3 A 4 J ci , A 2 —^ 6 fA^A 2 , A 3 —^ Z?AaA 4 , A 4 Zx A 5 —^ z? 

The only production to be modified using step 4 (refer to Theorem 6.9) 
is A| —^ A 2 A 3 A 4 . 

Replace A\ —) A 2 A 3 A 4 by A j —^ aA 3 A 2 A 3 A 4 . 

The required grammar in GNF is 

G 2 = ({A b Ay Ay A 4 , A 5 }, {a, Z?}, P 2 , ^ 1 ) where P 2 consists of 
A] —^ ^A5 Av4 4 J <2 
A 2 —^ 6 eAsA 2 
A^ —> Z?A 2 A 4 , 


A 4 —^ Z?, A 5 —^ CL 
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EXAMPLE 6.25 


If a context-free grammar is defined by the productions 

S —» a | Sa | bSS | SSb \ SbS 

show that every string in L(G ) has more a' s than V s. 

Proof We prove the result by induction on | w |, where h ; € L(G). 

When | w | - 1 . then w = a. So there is basis for induction. 

Assume that S => h\ \ w\ < n implies that w has more a' s than V s. Let 
\w\ = n > 1. Then the first step in the derivation S A w is 5 bSS or 
S => SSh or 5 =» S£S. In the first case, S => bSS bw x w 2 = w for some 
nq, e X* and 5 => uq, 5 => w 2 . By induction hypothesis each of wq and 
hs has more a’s than b’ s. So wqws has at least two more as than fe’s. Hence 
b\V\W 2 has more as than b’s. The other two cases are similar. By the principle 
of induction, the result is true for all w s L(G). 


EXAMPLE 6.26 


Show that a CFG G with productions 5 —> 55 | (5) | A is ambiguous. 


Solution 


Also. 


5 => SS => 5(5) => A(S) => A(A) = (A) 
5 => SS => (5)5 => (A)5 => (A)A = (A) 


Hence G is ambiguous. 


EXAMPLE 6.27 


Is it possible for a regular grammar to be ambiguous? 

Solution 

Let G = (L v , X, P, S) be regular. Then ever} 7 production is of the form 
A —> aB or A —» b. Let w e L(G). Let S => w be a leftmost derivation. We 
prove that any leftmost derivation A => w\ for every A e V v is unique by 
induction on | w |. If | w | = 1, then w = a e P. The only production is A —> a. 
Hence there is basis for induction. Assume that any leftmost derivation of the 
form A => w is unique when \w\ = n - 1. Let | w | = /? and A => w be a 
leftmost derivation. 

Take w = a\v h a e T. Then the first step of A ==> w has to be A :=> aB 
for some B e Vy. Hence the leftmost derivation A w can be split into 
4 &P ==> aw\. So. we get a leftmost derivation B => wq. By induction 
hypothesis, B =S> wq is unique. So. we get a unique leftmost derivation of w. 
Hence a regular grammar cannot be ambiguous. 
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SELF-TEST 

1. Consider the grammer C which has the productions 
A —> a | Aa | bAA \ AAb | AbA 
and answer the following questions: 

(a) What is the start symbol of G? 

(b) Is aaabb in L(G)? 

(c) Is aaaabb in L(G)? 

(d) Show that abb is not in L(G). 

(e) Write the labels of the nodes of the following derivation tree T 
which are not labelled. It is given that T is the derivation tree 
whose yield is in {a, £}*. 



Fig. 6.14 Derivation tree for Question 1(e). 


2. Consider the grammar G which has the following productions 
S aB\bA , A -> aS \ bAA | a, B -> bS\aBB\b. 

and state whether the following statements are true or false. 

(a) L(G) is finite. 

(b) abbbaa 6 L{G) 

(c) aab g L(G) 

(d) L(G) has some strings of odd length. 

(e) L(G) has some strings of even length. 
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3. State whether the following statements are true or false. 

(a) A regular language is context-free. 

(b) There exist context-free languages that are not regular. 

(c) The class of context-free languages is closed under union. 

(d) The class of context-free languages is closed under intersection. 

(e) The class of context-free languages is closed under 
complementation. 

(f) Every finite subset of {a, Z?}* is a context-free language. 

(g) {d l b n c u \n > 1} is a context-free language. 

(h) Any derivation tree for a regular grammar is a binary tree. 

EXERCISES 

6.1 Find a derivation tree of a * b + a * b given that a * b + a * b is in 
L(G ), where G is given by S —> S + S\ S * 5, 5 —> a | b. 

6.2 A context-free grammar G has the following productions: 

S -> 0501 1511 A, A 2B3. B -> 2^3 | 3 
Describe the language generated by the parameters. 

6.3 A derivation tree of a sentential form of a grammar G is given in 
Fig. 6.15. 





(a) What symbols are necessarily in V N 7 

(b) What symbols are likely to be in 2? 

(c) Determine if the following strings are sentential forms: (i) X 4 X 2 , 
(ii) X 2 X 3 X 3 X 2 X 3 X 3 , and (iii) A 2 A 4 X 4 Z 2 . 

6.4 Find (i) a leftmost derivation, (ii) a rightmost derivation, and (iii) a 
derivation which is neither leftmost nor rightmost of ababciba , 
given that ababciba is in L(G), where G is the grammar given in 
Example 6.4. 
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6.5 Consider the following productions: 

S —> ciB j bA 
A —) ciS | bAA | a 
B bS\aBB\b 

For the string aaabbabbba , find 

(a) the leftmost derivation. 

(b) the rightmost derivation, and 

(c) the parse tree. 

6.6 Show that the grammar S —» a \ abSb | aAb , A —> bS | aAAb is ambiguous. 

6.7 Show that the grammar S —> aB \ab, A —> ciAB | a. B —» ABb | b is 

ambiguous. 

6.8 Show that if we apply Theorem 6.4 first and then Theorem 6.3 to a 
grammar G, we may not get a reduced grammar. 

6.9 Find a reduced grammar equivalent to the grammar S —> aAa , A -» 
bBB , B -> ab, C -> aB. 

6.10 Given the grammar 5 —» AB . A —> a. B —> C | £>, C —> D, D -» £, 
£ —> < 7 , find an equivalent grammar which is reduced and has no unit 
productions. 

6.11 Show that for getting an equivalent grammar in the most simplified 

form, w 7 e have to eliminate unit productions first and then the 

redundant symbols. 

6.12 Reduce the following grammars to Chomsky normal form: 

(a) S -> 1A | 05, A -> 1AA | 05 | 0, B -> OBB | 15 | 1 

(b) G = ({5}, { a , b, c} 9 {S a\b\ cSS }, S ) 

(c) S —» abSb | a | aAb. A —> bS | aAAb . 

6.13 Reduce the grammars given in Exercises 6.1, 6.2, 6 . 6 , 6.7, 6.9, 6.10 
to Chomsky normal form. 

6.14 Reduce the following grammars to Greibach normal form: 

(a) S SS, 5 0S1 | 01 

(b) S ^ AB. A —> A -> BB. B -> aA£, S -> a, A —> fe 

(c) S A0, A 05. 5 -> A0, i? —» 1 

6.15 Reduce the grammars given in Exercises 6.1, 6.2, 6 . 6 , 6.7, 6.9, 6.10 
to Greibach normal form. 

6.16 Construct the grammars in Chomsky normal form generating the 
following: 

(a) {wcw* | w e 0 {a, £}*}, 

(b) the set of all strings over {cl b) consisting of equal number of a 's 
and V s, 
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(c) { a m b n | m * n, m, n > 1 }. and 

(d) {a n b m c n \m, n > 1 }, 

6.17 Construct grammars in Greibach normal form generating the sets given 
in Exercise 6.16. 

6.18 If w e L(G) and | w | = k , where G is in (i) Chomsky normal form, 
(ii) Greibach normal form, what can you say about the number of steps 
in the derivation of w? 

6.19 Show that the language {a n ~ \ n > 1} is not context-free. 

6.20 Show that the following are not context-free languages: 

(a) The set of all strings over {a, b , c} in which the number of 
occurrences of a , b , c is the same. 

(b) {a m b m c 11 \ m < n < 2m}. 

(c) {a m b n | n = m 2 }. 

6.21 A context-free grammar G is called a right-linear grammar if each 
production is of the form A —» wB or A —> tv, where A, B are variables 
and tv € Z*. (G is said to be left-linear if the productions are of the 
form A —» Bw or A -» tv. G is linear if the productions are of the form 
A —> vBw or A —> tv.) Prove the following: 

(a) A right-linear or left-linear grammar is equivalent to a regular 
grammar. 

(b) A linear grammar is not necessarily equivalent to a regular 
grammar. 

6.22 A context-free grammar G is said to be self-embedding if there exists 
some useful variable A such that A A> wAv, where u, v e X*, u, 
v ^ A. Show that a context-free language is regular iff it is generated 
by a nonselfembedding grammar, 

6.23 Show that ever}' context-free language without A is generated by a 
context-free grammar in which all productions are of the form A —> a, 
A —» accb . 



Pushdown Automata 



In this chapter we introduce pushdown automaton (pda). We discuss two types 
of acceptance of sets by pushdown automata. Finally* we prove that the sets 
accepted by pushdown automata are precisely the class of context-free 
languages. 

7.1 BASIC DEFINITIONS 

We have seen that the regular languages are precisely those accepted by finite 
automata. If M is a finite automaton accepting L, it is constructed in such a way 
that states act as a form of primitive memory. The states ‘remember the 
variables encountered in the course of derivation of a string. (In M, the states 
correspond to variables.) Let us consider L = {a n b u \n > 1}. This is a context- 
free language but not regular. (S —» aSb | ab generates L. Using the pumping 
lemma we can show that L is not regular; cf. Example 5.20.) 

A finite automaton cannot accept L, i.e. strings of the form a n b } \ as it has 
to remember the number of a s in a string and so it will require an infinite 
number of states. This difficulty can be avoided by adding an auxiliary 
memory in the form of a 'stack’ (In a stack we add the elements in a linear 
way. While removing the elements we follow' the last-in-first-out (LIFO) 
basis, i.e. the most recently added element is removed first.) The a's in the 
given string are added to the stack. When the symbol b is encountered in the 
input string, an a is removed from the stack. Thus the matching of number 
of c s and the number of b 's is accomplished. This type of arrangement where 
a finite automaton has a stack leads to the generation of a pushdown 
automaton. 

Before giving the rigorous definition, let us consider the components of a 
pushdown automaton and the w ; ay it operates. It has a read-only input tape, 
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an input alphabet, a finite state control, a set of final states, and an initial state 
as in the case of an FA. In addition to these, it has a stack called the pushdown 
store (abbreviated PDS). It is a read-write pushdown store as we add elements 
to PDS or remove elements from PDS. A finite automaton is in some state 
and on reading, an input symbol moves to a new state. The pushdown 
automaton is also in some state and on reading an input symbol and the 
topmost symbol in PDS, it moves to a new state and writes (adds) a string of 
symbols in PDS. Figure 7.1 illustrates the pushdown automaton. 

We now give a formal definition of a pushdown automaton. 


r 

E 

: 



hinite store 


Z 

< 

Storing 

control 




direction 


Removing 

direction 


Pushdown store 


Fig. 7.1 Model of a pushdown automaton. 


Definition 7.1 A pushdown automaton consists of 

(i) a finite nonempty set of states denoted by Q, 

(ii) a finite nonempty set of input symbols denoted by X, 

(iii) a finite nonempty set of pushdown symbols denoted by T, 

(iv) a special state called the initial state denoted by q Q , 

(v) a special pushdown symbol called the initial symbol on the pushdown 
store denoted by Zq, 

(vi) a set of final states, a subset of Q denoted by F, and 

(vii) a transition function 8 from < 2 x(Su {A}) x F to the set of finite 
subsets of Q x r*. 

Symbolically, a pda is a 7-tuple, namely (Q, X, T, 5, q 0 < Z 0 , F). 

Note: When S(q. a. Z) = 0 for (q, a. Z) e Q x (X u {A}) x F, we do not 
mention it. 


EXAMPLE 7.1 


Let 

A = (Q, 2, T. <5. < 7 o, Zq. F) 

where 

Q = {<7o> <7i, 0 /}, I = {a, b). F = {a. Zq}, F = {q t } 
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and <5 is given by 

8(q 0 , a, Z 0 ) = {(<7o. flZo)}, 8(q h b, a) = {(< 71 . A)} 

<5(<7o, a) = {(< 7 0 , aa)}, 8(q h A, Z 0 ) = {(q h A)} 

< 5 (< 7 0 , b. a)= {(< 7 ,, A)} 

Remarks 1. 5(< 7 , a, Z) is a finite subset of Q x r*. The elements of 
<5(g, a. Z) are of the form (</'. a), where q e £), a € T*. c5(c/, a. Z) may 
be the empty set. 

2. At any time the pda is in some state q and the PDS has some symbols 
from T. The pda reads an input symbol a and the topmost symbol Z in PDS. 
Using the transition function <5, the pda makes a transition to a state q and 
writes a string a after removing Z. The elements in PDS which were below' 
Z initially are not disturbed. Here (q\ a) is one of the elements of the finite 
set 8(q, a . Z). When a = A, the topmost symbol, Z. is erased. 

3. The behaviour of a pda is nondeterministic as the transition is given 
by any element of 8{q . cl Z). 

4. As 8 is defined on Q x (X u {A}) x F, the pda may make transition 
without reading any input symbol (wTien 8{q , A, Z) is defined as a nonempty 
set for q e Q and Z e T). Such transitions are called A-moves. 

5. The pda cannot take a transition w'hen PDS is empty (We can apply 
8 only when the pda reads an input symbol and the topmost pushdown symbol 
in PDS). In this case the pda halts. 

6 . When we write a = Z{Z 2 . .. Z m in PDS. Z x is the topmost element, 
Z 2 is below' Zj. etc. and Z m is below 7 Z m _ j. 

In the case of finite automaton, it is enough to specify the current state at 
any time and the remaining input string to be processed. But as we have the 
additional structure, namely the PDS in pda, w 7 e have to specify the current 
state, the remaining input string to be processed, and the symbols in the PDS. 
This leads us to the next definition. 

Definition 7.2 Let A = ( Q , Z, T, 5, q$. Z 0 , F) be a pda. An instantaneous 
description (ID) is (q, a, a), where q e Q, x e X* and a e T*. 

For example, ( q , a } a 2 . . . a }V Z{Z 2 . . . Z m ) is an ID. This describes the 
pda w'hen the current state is q , the input string to be processed is a x a 2 . . . a rr 
The pda will process ci\ci 2 . . . a n in that order. The PDS has Z b Z 2 , . . ., Z m 
with Z] at the top. Z 2 is the second element from the top, etc. and Z m is the 
low'est element in PDS. 

Definition 7.3 An initial ID is ( q 0 , a, Z 0 ). This means that initially the pda 
is in the initial state c/ 0 . the input string to be processed is a, and the PDS has 
only one symbol, namely Z 0 . 

Note: In an ID (cj< a. a), x may be A. In this case the pda makes a A-move. 
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For a finite automaton, the working can be described in terms of change 
of states. In the case of pda. because of its additional structure, namely PDS, 
the working can be described in terms of change of IDs. So we have the 
following definition: 

Definition 7.4 Let A be a pda. A move relation, denoted by |—, between IDs 
is defined as 

CliCl2 . . . Cl fr Z[Z 2 • • * Z m ) j (q\ ClnCl^ . . - Cl tv fiZ2 . • • Z fU ) 

if o(q , a b Zj) contains (q\ p). 

Note: The move relation 

(q-. «!«2 • • • z i z : . • • Z,„) |— (q\ a 2 a 3 . . . a,„ j3z 2 ... Z m ) 

can be described as follows: The pda in state q with Z.Z 2 . . . Z m in PDS 
(Zj is at the top) reads the input symbol a x . When ( q\ p) e S(q , a x , Zj), the 
pda moves to a state q and writes ft on the top of Z 2 . .. Z nr After this 
transition, the input string to be processed is # 2^3 • • • a ?r 

If p = Y x Y 2 ... Y^ then Fig. 7.2 illustrates the move relation. 



Fig. 7.2 An illustration of the move relation. 


Remark As |— defines a relation in the set of all IDs of a pda, we can define 
the reflexive-transitive closure p— which represents a definite sequence of n 
moves, where n is any non-negative integer. 

If (q, x. a ) p~ (q\ y\ p) represents n moves, we write (q, x, a ) |JL 
(q\ v, p). In particular, (< 7 , x, a) [2L (< 7 , x, a). Also, (q, x, a) (q\ y, p) 
can be split as 

07 » x '. a ) | (< 7 i* -Vj, #1) |— (cj2, x'2, CC2) |— ■ . . |— (q\ y\ P ) 

for some x b x 2 , . . ., e X*. a u a 2 , . . ., € T* 

iVote: When we deal with more than one pda, we also specify the pda while 
describing the move relation. For example, a move relation in A is denoted 

by hr • 
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The next two results are properties of the relation \-~ and are frequently 
used in constructions and proofs. 

Result 1 If 

(q h x , a) p- A, /3) (7.1) 

then for every y e 

xv’. a) p_ (^ 2 , y, p) (7.2) 

Conversely, if at. a) p- (<7 2 , y, P) for some v g E*, then (q h x, a ) p~ 
(</ 2 , A, £). 

Proof The result can be easily understood once we refer to Fig. 7.2, 
providing an illustration of the move relation. 

If the pda is in state q { with a in PDS. and the moves given by (7.1) are 
effected by processing the string x, the pda moves to state q 2 with p in PDS. 
The same transition is effected by starting with the input string xy and 
processing only x. In this case, v remains to be processed and hence we 
get (7.2). 

We can prove the converse part, i.e. (7.2) implies (7.1) in a similar way. 


Result 2 If 

then for every’ /3 e F*. 


(cj. x, oc) p- (q . A, y) 


(<7* x. aft) p- (q\ A, yp) 


Proof The sequence of moves given by (7.3) can be split as 


(7.3) 

(7.4) 


(q, x. a) |— (< 7 i, X]. oq) |— (q 2 > xy oc 2 ) j— * * ■ f— (q'« A, y) 

Consider (q r x h a } ) \— (q j+Xt x /+1 , a M ). Let a, - Z{Z 2 ... Z m . As a result 
of this move, Z\ is erased and some string is placed above Z 2 . . . Z m . So, 
Z 2 . . . Z m is not affected. If we have P below Z 2 . . . Z m . then also 
Z 2 . . . Z m P is not affected. So we obtain ( q h x h a f p) )— (qj+\, x i+h oc i+] p). 
Therefore, we get a sequence of moves 


(q, x, OCp) |— (qi- Xj, CCiP) |— . • . |— (q\ A, y/3) 
i.e. 

{q, x, ap) p- (r/. A. yp) \ 

Note : In general. (7.4) need not imply (7.3). Consider, for instance, 


A = ({<7 0 }- {<*, bh {Zb}, 5, r /o . Z 0 , 0) 


S(q 0 . a , Z 0 ) - {(<7o* A)}. <S(<7 0 , b.. Z 0 ) = {(<7 0 , ZoZ 0 )} 

(< 7 0 . ZqZqZqZq) 

j — (<7o> ^ ZqZqZq) 

|— 0?o ZoZo) 

|— (C/o- A, ZqZqZq) 


where 
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i.e. 

(< 7 o aab , ZqZqZqZq) p- (#o, A, ZqZqZq) 

However, (g 0 - aai, Zq) |— (g 0 - a b. A); hence the pda cannot make any more 
transitions as the PDS is empty. This shows that (7.4) does not imply (7.3) 
if we assume a = ZqZqZq. /3 = Zq, y = Zr>Z 0 . 


EXAMPLE 7.2 


'o’ 

O 

So¬ 

il 

{a, b, c}, {a, b. Zq}, 8. q () , Zq, {q t }) 


is a pda, where <5 is defined 

as 


<5(<7q, a. Z 0 ) = {(<7o, ciZq )}. 

8(q 0 , b, Z 0 ) = {(< 7 0 . bZQ} 

(7.5) 

§(ci Q ,. a. a) = {(q 0 , aa)j , 

8(q 0 , b, a) = {( q 0 , ba )} 

(7.6) 

< 5 ((? 0 ’ a - A> = {(<7o- ab )}’ 

8(q 0 , b, b) = {(go, bb)} 

(7.7) 

S(q Q , c , a)= {(q h a)}, 

S(q 0 . c. b) = {(< 7 i, b)}, 8(q 0 , c, ZQ 



II 

£ 

(7.8) 

< 5 (< 7 i- a, a)= 8{q { . b. b) ■ 

= {(?„ A)} 

(7.9) 

8{q x . A. Z 0 ) = {{q f . Z 0 )j 


(7-10) 


We can explain 5 as follows: 

If A is in initial ID. then using Rule (7.5). A pushes the first symbol of 
the input string on PDS if it is a or b. By Rules (7.6) and (7.7), the symbols 
of the input string are pushed on PDS until it sees the centre-marker c. By 
Rule (7.8). on seeing c. the pda moves to state c\\ without making any changes 
in PDS. By Rule (7.9). the pda erases the topmost symbol if it coincides with 
the current input symbol (i.e. if they do not match, the pda halts). By Rule 
(7.10). the pda reaches the final state q f only when the input string is 
exhausted, and then the PDS has only Zq. 

We can explain the concepts of ID. moves, etc. for this pda A. Suppose 
the input string is acab. We will see how the pda processes this string. An 
initial configuration is (q 0 , bacab , Zq). We get the following moves: 


(q 0 . bacab , Zq) | — (q 0 , acab , Z?Z 0 ) 
|— (q 0 , cab, abZo) 

|— (q { . ab . abZQ) 

|— (qo b, bZo) 

|— (q { , A, Zq) 

|— (q f . A. Zq) 


by Rule (7.5) 
by Rule (7.7) 
by Rule (7.8) 
by Rule (7.9) 
by Rule (7.10) 
by Rule (7.10) 


i.e. 


(q 0 , bacab. Zq) (q fi Zq) 
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Proceeding in a similar way, we can show that 

(q 0 , wcw T . Zq) p- (q^ A, Zq) for all w e {a, b}* 
Suppose an initial configuration is ( q 0 , abcbb , Z 0 ). Then, we have 

(< 7 o abcbb. Z 0 ) j— (< 7 0 , tfZ 0 ) by Rule (7.5) 

j— (^o, ebb, baZo) by Rule (7.6) 

|— (q { . bb, baZo ) by Rule (7.8) 

|— (q h b. tfZ 0 ) by Rule (7.9) 


Once the pda is in ID (■ q h b , oZ 0 ), it has to halt as S(q u b. a) = 0. Hence, 
we have 


(q 0 , abcbb. Z 0 ) p~ (q { , b, aZ 0 ) 

As 5(^0' c. Zq) = 0, the pda cannot make any transition if it starts with an 
ID of the form ( q 0 , c\v. Zq). 

Ahte: In Example 7.2. each a. Z) is either empty or consists of a single 
element. So for making transitions, the pda has only one choice and the 
behaviour is deterministic. 


In general, a deterministic pda can be defined as follows: 

Definition 7.5 A pda A = (Q. I. T. 5. q 0 . Z 0 , F) is deterministic if 
(i) S(q, a, Z) is either empty or a singleton, and (ii) 8(q. A. Z) ^ 0 implies 
S(q , a, Z) = 0 for each a e Z. 

Consider the pda given in Example 7.2. 8(q. a. Z) given by Rules 
(7.5)-(7.10) are singletons. Also. 8{q\, a. Z 0 ) = 0 and 8{q\. a, Zq) = 0 for 
all a e X. So the pda given in Example 7.2 is deterministic. 


7.2 ACCEPTANCE BY pda 

A pda has final states like a nondeterministic finite automaton and has also the 
additional structure, namely PDS. So we can define acceptance of input strings 
by pda in terms of final states or in terms of PDS. 

Definition 7.6 Let A = (Q, I, T, <5. q$, Zq, F) be a pda. The set accepted 
by pda by final state is defined by 

T(A) = {w g Z*|07 O . w, Zq) !^- (qf. A, a) for some q f e F and a e f*} 


EXAMPLE 7.3 


Construct a pda A accepting L = {wcw T \w e {a, b}*} by final state. 
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Solution 


Consider the pda given in Example 7.2. Let \vcw T e L. Write w = a { a 2 . . . a iv 
where each a; is either a or b. Then, we have 

(9o- a { a 2 . . . a n cw ', Z 0 ) 

P- (q o, cw r . «„«„_! . . . a{Z q) by Rules (7.5)—(7.7) 

p- (9b • • • «b • ■ • «iZo) by Rule (7.8) 

|^- {q x , A. Z 0 ) by Rule (7.9) 


|—(< 7 /; A. Zq) by Rule (7.10) 

Therefore, wcw r e T(A), i.e. L c T(A). 

To prove the reverse inclusion, it is enough to show that U c T{A)‘\ Let 
x € L c . 


Case 1 x does not have the symbol c. In this case the pda never makes a 
transition to q x . So the pda cannot make a transition to q f as we cannot apply 
Rule (7.10). Thus, x e T(Af. 

Case 2 

X = H'jCWb. \v 2 ^ w{ 

(9o- w'jcwb. Zq) 

H- (4o> cvv 2 , npZo) 

|— (9b H b- w '\ Zo) 

As vv 2 ^ h’] 7 *, the pda cannot reach an ID of the form (q\, A, Z 0 ). So we 
cannot apply (7.10). Therefore, x e T(A) r . 

Thus we have proved L c c T(A) C . 

The next definition describes the second type of acceptance. 

Definition 7.7 Let A = (Q . I. T. 5. < 70 * Z 0 , F) be a pda. The set N(A) 
accepted by null store (or empty store) is defined by 

N(A) = {w e 2*|(^o? vr. Zq) (q . A, A) for some <7 e £)} 

In other words, w is in A(A) if A is in initial ID (q 0 , w, Zq) and empties 
the PDS after processing all the symbols of w. So in defining N(A), we 
consider the change brought about on PDS by application of w, and not the 
transition of states. 


EXAMPLE 7.4 


Consider the pda A given by Example 7.2 with an additional rule: 

8(q f . A. Zb) = {{qf, A)} 


N(A) = (im/1 w e {a. b } *} 


Then. 


(7.11) 
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Solution 

From the construction of A, we see that the Rules (7.5)—(7.10) cannot erase 
Z 0 . We make a provision for erasing Z 0 from PDS by Rule (7.11). By 
Example 7.2, wcw T e T(A) if and only if the pda reaches the ID (q f , A, Z 0 ). 
By (7.11), PDS can be emptied by A-moves if and only if the pda reaches the 
ID (cfp A. Z 0 ). Hence. 

N(A) = {wcw T \w e {a , b}*} 

In the next theorem we prove that the set accepted by a pda A by null store 
is accepted by some pda B by final state. 

Theorem 7.1 If A = (Q, I. T, <5, q 0 , Z 0 , F) is a pda accepting L by empty 
store, we can find a pD 

B = (Q\ I, r, S B . q' 0i Zq, F') 

which accepts L by final state, i.e. L - N(A) = 7(5). 

Proo/ 5 is constructed in such a way that (i) by the initial move of 5, it 
reaches an initial ID of A. (ii) by the final move of 5, it reaches its final state, 
and (iii) all intermediate moves of B are as in A. 

Let us define B as follows: 

B = {Q\ I. n, S Bl <7 q. Z'o, F) 

where 

is a new 7 state (not in 0 ). 

F' = with q f as a new state (not in Q ), 

Q' - Qkj {q\y, q f }, 

Z'o is a new stait symbol for PDS of 5, 

r=Tu {Zq}, and 

<5 S is given by the rules R h R 2 , F 3 

with 

Fo 4(9 o- A, Z'o) = {(?o, ZqZ'o)}. 

F:- 5 S (^, a , Z) = < 5 (< 7 , a, Z) for all (r/, a, Z) in Q x (L u {A}) x F 

F 3 : 5 ^( 9 , A, Z'o) = {(^ A)} for all q e Q. 

By F b the pda 5 moves from an initial ID of F to an initial ED of A. 

R\ gives a A-move. As a result of R h B moves to the initial state of A with 

the start symbol Zq on the top of PDS. 

R 2 is used to simulate A. Once B reaches an initial ED of A. R 2 can be used 
to simulate moves of A. We can repeatedly apply R 2 until Z ' 0 is pushed to the 
top of PDS. As Zq is a new pushdown symbol, we have to use R 2 . 

R 2 gives a A-move. Using F 3 . B moves to the new (final) state qj erasing 
Zq in PDS. 

Thus the behaviour of B and A are similar except for the A-moves given 

by R\ and F 3 . Also, vr e T(B) if and only if B reaches q t \ i.e. if and only 
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if the PDS has no symbols from T (since B can reach cjf only by the 


application of /? 3 ). This suggests that T(B ) = N(A). 

Now we prove rigorously that N(A) - T(B ). Suppose w e 
the definition of N(A), (q 0 . w. Zq) 1-4- ( 9 * A, A) for some q 
we see that 

1V(A). Then by 
e Q. Using ft 2 , 

(9o* w, Zq) hi" (<7, A, A) 


By Result 2, 


(9o> 2 q z/ o) hi" (9> A, Z' 0 ) 

(7.12) 

By jRj. 


(9o* A, Zq) hx (9o A, ZqZq) 


By Result 1, we have 


(? o> Z 0 ) |x w? z oZo) 

(7.13) 

By R 3 , 


(9* A. Zq) ^ (9 f , A, A) 

(7.14) 

Combining (7.12)—(7.14), we have 


(9o> z o) hi" (9/* A, A) 


This proves that \v e T(B), i.e. N(A) e r(£). 

To prove T(B) c A r (A), we start with vc c T(B), Then 


( 90 ‘ vv - z 'o) Hr (?/* A, a) 

(7.15) 


But B can reach q f - only by the application of R 3 . To apply Zq should be 
the topmost element on PDS. Z' 0 is placed initially, and so when it is on the 
top there are no other elements in PDS. So a - A, and (7.15) actually reduces 
to 


O/o, M\ Z'q) |-§- (c/f. A, A) (7.16) 

In (7.16), the initial and final steps are effected only by A-moves. The 
intermediate steps are induced by the corresponding moves of A. So (7.16) can 
be split as (q' 0 , Aw. Z'q) ^ (q 0 , vv, Z 0 Z' 0 ) Yj- (q. A, Z'q) for some 
q e Q. Thus. (q' 0 . Aw, Z'q) \-^ ( q 0 , w. ZqZ'q) |-§- (q. A, Z' 0 ) j-g- (q f , A, A). 
As we get (c/ 0 . w. ZqZ'q) j-|- (q, A. Z'q) by applying ft? several times and ft 2 
does not affect Z'q at the bottom, we have (r/ 0 , w, Z 0 ) (q. A, A). By the 

construction of ft ; . we have (<r/ 0 , w. Z 0 ) |4j- (q. A, A), which means w e N(A). 
Thus. T(B) c N(A). and hence T(B) = N(A) = L. I 
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Note: From the construction of B . it is easy to see that 5 is deterministic if 
and only if A is deterministic. 


EXAMPLE 7.5 


Consider the pda A given in Example 7.1 (Take F = 0). Determine N(A). 
Also construct a pda £ such that T(B) = N(A). 

Solution 


A = ({< 7 o, <?i}, {a, b}, {a, Z 0 }, 8. q 0 , Z 0 , 0), 
where <5 is given by 


Ri 

S{q Q , 

a . 

Z 0 ) = 

{(<?£» aZ 0)} 

R z 

<5(<70' 

(7, 

a) = 

{(70- ««)} 

i? 3 

<5(7o- 

b. 

a) = 

{(7b A)} 


<5(7 i- 

b, 

a) = 

{(7b A)} 

R s 

8{q\, 

A, 

Zq) = 

{( 7 b A)} 


R x is used to store a in PDS if it is the first symbol of an input string. 
R 2 can be used repeatedly to store a n in PDS. When b is encountered for the 
first time in the input string, a is erased (in PDS) using Ry Also, the pda 
makes a transition to state q\. After processing the entire input string, if Z 0 
remains in PDS, it can be erased using the null move given by R 5 . So, if 
\v = a ,l b , \ then we have 

(<?q. ci n b n . Zq) p- (go b } \ ci n Zq) by applying R { and R 2 

p- (cji, A. Zq) by applying R ? and R 4 

|— (qi. A, A) by applying R 5 

Therefore. a u b n e N(A). 

If vr e N(A). then (<y 0 , \\\ Z 0 ) p~ (q { , A, A). (Note that the PDS can be 
empty only when A is in state q j.) Also, w should start with a. Otherwise, we 
cannot make any move. We store the symbol a in PDS if the current input 
symbol is a and the topmost symbol in PDS is a or Z 0 . On seeing the input 

symbol b, the pda erases the symbol a in PDS. The pda enters the ID {q h A, A) 

only by the application of R 5 . The pda can reach the ID (q u A, Z 0 ) only 
by erasing the a's in pda. This is possible only when the number of Z?’s is 
equal to number of a s, and so \v = ci n b n . Thus, we have proved that 
N(A) = {a n b n \ji >1}. 

Now let 

B = ( Q'. {a. b}. r. 8b, q' 0 . Z' 0 . F') 

where 


Q' = {<?o> q\, qjh 


F' = {?,}, r = {a. b. Z' 0 j 
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and 8 b is defined by 


3b(9o> 

A. Z'o) = 

’ {(90. 

Z 0 Z 0 )} 

<%(<7b 

a. Z 0 ) = 

: {(<7o> 

aZ 0 )} 

<5b(<?o> a. a) = 

{(<7o> 

aa)} 

S B (q 0 , b, a) = 

: {(<7i> 

A)} 

8 B (q 

1 , a) = 

: {(<7i> 

A)} 

^b(<3 b 

A, Zq) = 

; U<7b 

A)} 

5s(<? 0 ^ 

A, Z'o) = 

= 

A)} 


A, Zq) = 


A)} 


Thus* 

T(B) = N(A) = [a n b n \n > 1} 

The following theorem asserts that the set accepted by a pda A by final 
state is accepted by some pda B by null store. 

Theorem 7.2 If A = (Q, £, F, <5, q 0 , Z 0 , F) accepts L by final state, we can 
find a pda B accepting L by empty store; i.e. L = T(A) - N(B). 

Proof B is constructed from A in such a way that (i) by the initial move of 
B an initial ID of A is reached* (ii) once B reaches an initial ED of A, it 
behaves like A until a final state of A is reached, and (iii) when B reaches a 
final state of A, it guesses whether the input string is exhausted. Then B 
simulates A or it erases all the symbols in PDS. 

The actual construction of B is as follows: 


B = (Q u {tfo, d}. ITu {Z' 0 }, fa q' Q , Z' 0 , 0) 

where r / 0 is a new state (not in Q ), J is a new (dead) state, and Zq is the new 
start symbol for PDS of B . 

(5 S is defined by rules R 2 , and F 4 as 

^i : 0 ?o> A? Zq) ~ {(#0 ^ 0 ^ 0 )} 

R 2 : S B (q , a, Z) = <5(g, a, Z) for all a e X, <7 e g, Z e F 

/? 3 : <5 5 («?, A, Z) = 5 (< 7 , A, Z) u {(d. A)} for all Z e Fu {Z' 0 } and 4 e F 

F 4 : <5 5 (^, A, Z) = {(d, A)} for all Z e T u {Z' 0 } 

Using /?!, £ enters an initial ED of A and the start symbol Zq is placed on top 

of PDS. 

Using R 2 , B can simulate A until it reaches a final state of A. On reaching 
a final state of A. B makes a guess whether the input string is exhausted or 
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not. When the input string is not exhausted, B once again simulates A. 
Otherwise, B enters the dead state d. Rule R 4 gives a A-move. Using these 
A-moves, B erases all the symbols on PDS. 

Now w g T(A) if and only if A reaches a final state. On reaching a final 
state of A, the pda can reach the state d and erase all the symbols in the PDS 
by A-moves. So, it is intuitively clear that w e T(A) if and only if w € N(B). 
We now prove rigorously that T(A ) = N(B). 

Suppose w e T(A). Then for some q e F, a e T*, 

(?o> Zb) bj" (#» A, 

Using we g et 

(?o> ^b) Hr (<?> A > 

Applying Result 2, we obtain 

(<7o> ZqZ'o) |-§- (#, A, oZ'o) 

As 

(q Or A. Z o) |-g- (go ZqZq) 

using Result 1. we get 

0? 0^ z 0 ) \~y (90* w ’ ZqZq) 

From (7.18) and (7.17), we can deduce 

(g'o, w, Z'q) Hr (?, A, aZ'o) (7.19) 

By applying once and repeatedly, we get 

(. q , A, aZ6) hr W. A, A) (7.20) 

Relations (7.19) and (7.20) imply that (q' 0 , w\ Zq) Hr (rf, A, A). Thus we 
have proved T(A) c 7V(5). 

To prove that A r (5) c T(A), we start with w e N(B). This means that for 
some state q of B> 

(90, W'. Zb) Ft (9, A. A) (7.21) 

As the initial move of B can be made only by using R { , the first move of 
(7.21) is (r/o, Aw, Z' 0 ) (go- ZqZ'o). 

Z o in the PDS can be erased only when 5 enters d; B can enter d only 
when it reaches a final state g of A in an earlier step. So (7.21) can be split as 

(go, Aw, Zq) |-g- (qo, w\ ZqZq ) Hr ( q , A. C^Zq) Hr (g, A, A) 


(7.17) 


(7.18) 
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for some q e F and a e T*. But Ojr 0 * vr, ZqZ'q) j-|- (q, A, aZ' 0 ) can be 

obtained only by the application of Ry So the moves involved are those 
induced by the moves of A. As Z' 0 is not a pushdown symbol in A, Z' 0 lying 
at the bottom is not affected by these moves. Hence 

(Vo- Aw, Z' 0 ) |-^ (q. A. a), q € F 
So w e 7(A) and A(S) c T(A). Thus, 

L = N(B) = 7(A) 


EXAMPLE 7.6 


Construct a pda A accepting the set of all strings over {a , b} with equal 
number of ci s and /As. 

Solution 

Let 

A = ({#}, [a. &]. [Zq, a, b], 5, q, Zq, 0) 
where 5 is defined by the following rules: 

8(q, a , Z 0 ) = {(< 7 , aZ 0 )} 6 , Zq) = {(<?, £Z 0 )} 

<5(g, a, a) = {(< 7 , a#)} <5(g, b, b) = {(//. M>)} 

5(ry. a , i) = {(<?, A)} <5(r/, 6 , a) = {(r/. A)} 

5(9, A, Z 0 ) - {(, q , A)} 

The construction of 5 is similar to that of the pda given in Example 7.2. 
But here we want to match the number of occurrences of a and b\ so, the 
construction is simpler. We start by storing a symbol of the input string and 
continue storing until the other symbol occurs. If the topmost symbol in PDS 
is a and the current input symbol is b . a in PDS is erased. If w has equal 
number of a s and ZPs, then (q. vt\ Z 0 ) ( q , A, Zq) |—( q , A, A). So 

w e A(A). We can show that N(A) is the given set of strings over { a , using 
the construction of <5. 

73 PUSHDOWN AUTOMATA AND CONTEXT-FREE 
LANGUAGES 

In this section we prove that the sets accepted by pda (by null store or final 
state) are precisely the context-free languages. 

Theorem 7.3 If L is a context-free language, then we can construct a pda 
A accepting L by empty store, i.e. L = N(A). 
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Proof We construct A by making use of productions in G. 

Step 1 (Construction of A) Let L = 1(G), where G = (L v , X, P, 5) is a 
context-free grammar. We construct a pda A as 

A = ((g), X, V v u X, &, 5, 0) 

where 5 is defined by the following rules: 

R { : S(q, A. A) = {(< 7 , a) | A —» a is in P} 

R 2 : 8(q . a, a) = {(< 7 , A)} for every a in X 

We can explain the construction in the following way: The pushdown 
symbols in A are variables and terminals. If the pda reads a variable A on the 
top of PDS, it makes a A-move by placing the R.H.S. of any A-production (after 
erasing A). If the pda reads a terminal a on PDS and if it matches with the 
current input symbol then the pda erases a . In other cases the pda halts. 

If w e L(G) is obtained by a leftmost derivation 

S =$ U]A]CCi ^ ^ vv, 

then A can empty the PDS on application of input string >v. The first move 
of A is by a A-move corresponding to S —» zqAjCq. The pda erases S and 
stores u x A\(Xi. Then using P 2 * the pda erases the symbols in zq by processing 
a prefix of w. Now, the topmost symbol in PDS is A\. Once again by applying 
the A-move corresponding to Aj —> zz 2 A 2 Gf 2 , the pda erases A 2 and stores 
z/ 2 A 2 a 2 above Proceeding in this way. the pda empties the PDS by 
processing the entire string tv. 

Before proving that L(G) = N(A) (step 2), we apply the construction to 
an example. 


EXAMPLE 7.7 


Construct a pda A equivalent to the following context-free grammar: S 
QBE, B —» OS | IS jO. Test whether 010 4 is in A r (A). 

Solution 

Define pda A as follows: 

A = ({q}, {0. 1}. {S, B. 0. 1 }. 5. q. S, 0) 

8 is defined by the following rules: 

S(q. A. S) = {( 4 , OBB)} 

* 2 = <5(g. A. B) = {(q. OS). (q. (OSj, (q. 0)} 

Ry- 8{q. 0, 0) = {(, q. A)} 1S 
R 4 : 8(q. 1 , 1 ) = {(< 7 . A)} 
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Thus, 


(q, 010 4 , 5) 
f— (<?, 010 4 . 0 BB) 

by Rule R[ 

\— (cj , 10 4 , BB) 

by Rule P 3 

1— far, 10 4 , ISB) 

by Rule R 2 since (, q , 15) e a(< 7 , A, 5) 

1- (9. «}) 

by Rule P 4 

1— fa, 0 4 , OBflB) 

by Rule R { 

|— fa, 0 3 , B0B) 

by Rule P 3 

p- fa, 0 3 , 000 ) 

by Rule P 2 since (< 7 , 0 ) e a(q , A, 5) 

P- (9, A, A) 

by Rule P 3 


010 4 c N(A) 


Note: After entering ( q , 10 4 . BB), the pda may halt for a different sequence 
of moves, for example, ( q , 10 4 , BB) |— (< 7 , 10 4 , 05) |— ( q , 10 4 , 00). As 
S(q, 1 , 0 ) is the empty set, the pda halts. 


Let us continue with the proof of the theorem. 

Step 2 (Proof of the construction, i.e. L(G) = N(A )). First we prove L(G) 
c N(A). Let w e L(G ). Then it can be derived by a leftmost derivation. Any 
sentential form in a leftmost derivation is of the form juAa , where u e X*, 
A e V N and a e (V v u X)*. We prove the following auxiliary result: If 
S => uAa by a leftmost derivation, then 

(< 7 , uv, 5) p- (< 7 , v, Aa) for every v e X* (7.22) 

We prove (7.22) by induction on the number of steps in the derivation of 

uAa . If 5 => «A, then w = A, a = A, and 5 = A. As (< 7 , v, 5) p~ (< 7 , v, 5), there 
is basis for induction. 

Suppose 5 ==> wAa by a leftmost derivation. This derivation can be split 
as S => zqA^i => wAa. If the A r production we apply in the last step is 
Ai —^ w 2 A££ 2 , then u == tqw 2 , ct — ct 2 cq. 

As S =>u l A l a l . by induction hypothesis, 


( q , MiM2 v , 5) (< q , z/ 2 v > AiOfi) (7.23) 

As Ai —> H : Aa 2 is a production in P, by Rule P* we get (< 7 , A, Aj) |— 
(g. A, w 2 Aa 2 ). Applying Results 1 and 2 in Section 7.1, we get 

(1 q , i/ov, A^p |— (<7, u 2 v. w 2 Aa 2 a]) 


|— (< 7 , v, Aa 2 tXj) by Rule R 2 


Hence, 


(q, uyv, Aibfj) p- (<7, v, Aa 2 «t) 


( 7 . 24 ) 
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But UiU 2 = m and = a. So from (7.23) and (7.24), we have 

(g, mv, 5) p- (r/, v, A a) 


Thus (7.22) is true for 5 wAa. By the principle of induction, (7.22) is true 
for any derivation. Now we prove that L(G) c N(A). Let w e L(G). Then, 
w can be obtained from a leftmost derivation, 


From (7.22), 


S => uAv => uii'v = w 
(g, mm'v, 5) p- (g, m'v, Av ) 


As A —» u is in P, 


By Rule R 2 * 
Therefore, 

Next we prove 


( q , m'v, Av) p- (g, m'v, m'v) 

(#, m'v, m'v) p- (g, A, A) 
w = mm'v e A(A) proving L(G) c N(A) 
N(A) c L(G) 


Before proving the inclusion, let us prove the following auxiliary result: 

S ua if (q , mv, 5) p- (#, v, a) (7.25) 

We prove (7.25) by the number of moves in (q, mv, 5) p~ (g, v, a). 

n 0 

If (< 7 , mv, 5) p- (#, v, a), then u = A, 5 = a; obviously, 5 => Aa Thus 
there is basis for induction. 

Let us assume (7.25) when the number of moves is n. Assume 


0 1 , mv, S) p (q, v, a) (7.26) 

Tine last move in (7.26) is obtained either from (q, A, A) |— (q* A, a') or 

from (< 7 , a, a) |— (<q , A, A). In the first case, (7.26) can be split as 

(q, mv, 5) p- (q< v, Aa 2 ) |— ( q , v, c^o^) = (#, v, a) 

By induction hypothesis, S => uAa 2 , and the last move is induced by 
A —» cp Thus, 5 => implies Ofj or 2 = tz So, 

5 mA<x 2 => ua } a 2 = Ma 

In the second case, (7.26) can be split as 

(q , mv, 5) p- (g, mv, ad) j — (#, v, a) 

Also, u = m'm for some u e Z. So, (ry, m'mv, 5) p- (g, mv, aa) implies (by 

induction hypothesis) S =>. m'm a = zza. Thus in both the cases we have shown 
that S ==> wa By the principle of induction, (7.25) is true. 
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Now, we can prove that if w g N(A) then w e L(G). As w e N(A ), 
we have (g, w, 5) (<?, A, A). By taking u - w, v = A, a = A and applying 

(7.25), we get S wA = h\ i.e. w G L(G). Thus, 

L(G) = N(A ) 

Theorem 7.4 If A = (Q, X. T, <5, q& Z 0 , F) is a pda, then there exists a 
context-free grammar G such that L(G) = N(A), 

Proof We first give the construction of G and then prove that N(A) = L(G). 
Step 1 (Construction of G). We define G = (V N , 2, P, 5), where 

v* = {5} u z, 9 'i 1r/ e e, z g n 

i.e. any element of V, v is either the new symbol 5 acting as the start symbol 
for G or an ordered triple whose first and third elements are states and the 
second element is a pushdown symbol. 

The productions in P are induced by moves of pda as follows: 

F { : 5-productions are given by 5 —> [ q o, Z 0 . q] for every q in Q . 

R< Each move erasing a pushdown symbol given by (q\ A) e 8{q , a, Z) 
induces the production [ q , Z, </] —» a. 

Each move not erasing a pushdown symbol given by (q h Z(Z 2 ... Z m ) 
g 8(q , < 7 , Z) induces many productions of the form 

[q. Z. r/] -> Z h q 2 ][q 2 . Z 2 , 93 ] ... [<?,„, Z m , 4 '] 

where each of the states ^ • • •« q m can be any state in (7. Each move yields 
many productions because of Ry We apply this construction to an example 
before proving that L{G ) = N(A). 


EXAMPLE 7.8 


Construct a context-free grammar G which accepts A(A), where 
A = ({<? 0 . <7 iK {«• b}, {Z 0 . Z}, 5. go. Zo, 0) 
and 5 is given by 

<5(<?o, b, Zo) = {(<7o> zz o)} 

5(<7 0 , A, Z 0 ) = {(^o> A)} 

% 0 , ft, Z) = {(*>, ZZ)} 

<5(<? 0 . Z) = {(<?i, Z)} 
b, Z)= {(^i, A)} 
a, Zo) = {(<? 0 , Z 0 )} 

Solution 

Let 

G = (V v . {a. &}, P. 5) 
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where V*- consists of S, [q& Zq, q 0 ], [ q 0 , Zq, q x \ [q 0 . Z, <? 0 ], Z, ^], 
[<5fi, Z 0 , %], [<r/ b Z 0 . q { ], [q h Z, <? 0 ], [<? b Z. qrj]. 

The productions are 

Pj : 5 —> [<?o Zq. <7 0 ] 

P 2 : S h/o Zq, <7 t ] 

5(<y 0 * b , Z 0 ) = {(<&• ZZq)} yields 

F ? : [<y 0 , Z 0 , » b[<7o> Z. <ry 0 ][<7o* Z 0 , <7ol 

P 4 : [< 7 o, Z 0 , q 0 ] Z?[c/ 0 * Z, ry 3 ][g b Z 0 , <7ol 

P 5 : [<7 0 , Z 0 . <7J —> &[<7o Z, ry 0 ][(7 0 , Z 0 , <7 j] 

^6 : [<7o* Zq. q\] —> &[<7o> Z, # 3 ][< 7 b Zq. < 7 3 ] 

5(V /0 . A, Zq) = {G?o> A)} gives 

P ?• [#o* Z 0 . <7 0 ] A 
<5riyo b. Z) = {(<7 0 . ZZ)} gives 

/V [<7o z, < 7 oJ —> *[<7o> Z. £/o][<7o> Z, ^ 0 ] 

P 9 : [r/ 0 . Z. r/oJ -» b[q 0 , Z, qi j [<7 b Z, <7 q] 

Prv [<7o- Z. <7i] —» *[9o- Z. <7o][^o* Z* <7il 

Fi b [? 0 . Z. q{\ —» b[q& Z. ][^i ? Z, ry 3 ] 

<5(g 0 . a. Z) = {(Vy b Z)} yields 

^32 : [< 7 o? Z. ry 0 ] -> a[c7 b Z <y 0 ] 

P 13 : [g 0 . Z. < 71 ] —> ap/i, Z, ry 3 ] 

<5riy b b, Z) = {(<y b A)} gives 

P u : [<7i- Z, b 

S(q u a . Zq) = {(ry 0 . Zq)} gives 

F i5 : [<y b Zq, cy 0 ] “> r/[^ 0 , Zq, </o] 

P\t>' [c/ b Zq- tfi] ™> tffeo, Zo, ry 3 ] 

P 3 —Pi6 give the productions in P. 

Using the techniques given in Chapter 6. we can reduce the number of 
variables and productions. 

Step 2 Proof of the construction, i.e. N(A) - L{G). 

Before proving that N(A) = L(G), we note that a variable [q. Z, q \ indicates 
that for the pda the current state is q and the topmost symbol in PDS is Z In 
the course of a derivation, a state q is chosen in such a way that the PDS is 
emptied ultimately. This corresponds to applying R 2 . (Note that the production 
given by R 2 replaces a variable by a terminal.) 
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To prove N(A) = L(G X we need an auxiliary result, i.e. 

[q , 4 ^ => w (7.27) 

if and only if 

(c h w, Z) (<?', A, A) (7.28) 

We prove the ‘if part by induction on the number of steps in (7.28). If 
(< 7 , w, Z) |— (q\ A, A), then w is either a in £ or a in A. So, we have 

(q. A) e <5(f/. iv, Z) 

By R 2 we get a production [« q , Z, g'] —» w. So, [#, Z, #'] => w. Thus there 
is basis for induction. 

Let us assume the result, namely that (7.28) implies (7.27) when the 
former has less than k moves. Consider ( q , w, Z) |Z- (q\ A, A). This can be 
split as 

(q, aw', Z)\-(q it w\ Z,Z 2 . . . ZJ j^ 1 (q\ A, A) (7.29) 

where w = aW and a e S or a - A, depending on the first move. 

Consider the second part of (7.29). This means that the PDS has ZjZ 2 . . . 
Z m , initially, and on application of h\ the PDS is emptied. Each move of pda 
can either erase the topmost symbol on the PDS or replace the topmost symbol 
by some non-empty string. So several moves may be required for getting on the 
top of PDS. Let wj be the prefix of w such that the PDS has Z 2 Z 3 . .. Z m after 
the application of w { . We can note that the string Z 2 Z 3 . . . Z m is not disturbed 
while applying Wj. Lei w; be the substring of w such that the PDS has 
Z i+] . . . Z m on application of Z M .. . Z m is not disturbed while applying 
Wj, vv 2 , . . w t . Tlie changes in PDS are illustrated in Fig. 7.3. 



. Empty 
store 


Fig. 7.3 Illustration of changes in pushdown store. 
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In terms of IDs, we have 

(q h w h Zj) p- (q i+u A. A) for i = 1, 2, .. m, q m+l = q (7.30) 

As each move in (7.30) requires less than k steps, by induction hypothesis we 
have 

[c/p Z h g /+1 ] Wj for i ~ 1, 2, . . m (7.31) 

G 

The first part of (7.29) is given by (q h Z{Z 2 . . . Z m ) e S(q , a, Z). By /? 3 
we get a production 

[<?. Z, c/| =t> rt[<7h Z|, qJ\q 2 , Z 2 , q 2 ] ■■ ■ [q m , Z m , q] (7.32) 
From (7.31) and (7.32), we get 

1q , Z, <?'] ==> a>V[H ’2 ., . w m = w 

By the principle of induction, (7.28) implies (7.27). 

We prove the 'only if part by induction on the number of steps in the 
derivation of (7.27). Suppose [ q , Z, q\ => w. Then [< q , Z, p] w is a 
production in R. This production is obtained by R 2 . So >v - A or we X and 
(q\ A) € S(q< w, Z). This gives the move (q, w, Z) j— (,q , A, A). Thus there 
is basis for induction. 

Assume the result for derivations where the number of steps is less than k . 
Consider [</, Z, g'] w. This can be split as 

[q. Z. q\ => fl [ 9l , Z,. q 2 ][q 2 , Z 2 , c/ 3 ] . . . [q, n , Z m , q] % w (7.33) 

As G is context-free, we can write w = aw { w 2 . . . w nv where 

Ulh Zn Qi+\] => and q m+{ = 

G' 

By induction hypothesis, we have 

(tf,-, Wp Zf) \±- (q i+ j. A, A): for / = 1, 2, . . ., m (7.34) 

By applying Results 1 and 2, we get 

ZjZ{ + \ . , . Z m ) p— (<//+{. A, Zj + ^ . . . Z m ) 

™i w M. • • * Z; ... ZJ p- (r/ /+l , w /+1 . . . w„p Z /+1 . . . Z m ) (7.35) 
By combining the moves given by (7.35), we get 

(q h wpv : . . . Z, ... Z m ) p- (c/, A, A) (7.36) 

The first step in (7.33) is induced by (q\, Z{Z 2 . . . Z,„) e <5(g, a, Z). The 
corresponding move is 

a, Z) j— 0/u A, Z^Zb . . . Z m ) 

By applying the Result 1, we get 

(q, awi . . . w m , Z) |— w { . . . w m , Z { ... Z m ) 


(7.37) 
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From (7.37) and (7.36). we get (q, h\ Z) p- (q'\ A } A) . By the principle 
of induction. (7.27) implies (7.28). 

Thus we have proved the auxiliary result. In particular. 

[< 70 , Zq. q { ] => w iff ( q 0 . w, Z 0 ) p- (q\ A. A) (7.38) 
Now w e L{G) 
iff S A w 

iff S =$ [q 0 , Zq. q] => w (for some q by R^) 

iff (c/q. vt\ Zq) p- (c/. A. A) by the auxiliary result 

iff iv e A'(A) 

Thus. iV(A) - L(G). I 

Corollary If A is a pda. then there exists a context-free grammar G such that 
T(A) = L(G). 

Proof By Theorem 7.2 we can find a pda A' such that T(A) = N(A ')• By 
Theorem 7.4 we can construct G such that N(A') = L(G). Tints 7(A) = L(G). I 


EXAMPLE 7.9 


Construct a pda accepting {a n b m a n \nu n > 1} by null store. Construct the 
corresponding context-free grammar accepting the same set. 

Solution 

The pda A accepting [d l b m d l | m, n > 1} is defined as follows: 

A = (k/ 0 - <7i}, {a, fc}, {a, Z 0 }, 5. </ 0 . Z 0 . 0) 
where <5 is defined by 

RT 5(r/ 0 , a, Zo) - {(< 7 0 , oZq)} 

R 2 ; c5C<7q. a, a) ~ {(q& aa)} 

R ? : S(q Q , h. a) = {(q u a)} 

R 4 : <5(1/1. b> a) - {(< q u a)} 

R 5 : 5(9i. a. a) = {(<?!, A)} 

R 6 : 5(9!. A. Zo) = {( 91 , A)} 

This is a modification of <5 given in Example 7.2. 

We start storing d s until a /? occurs (Rules Ri and R 2 ). When the current 
input symbol is b . the state changes, but no change in PDS occurs (Rule R 3 ). 
Once all the b 's in the input string are exhausted (using Rule R 4 ). the 
remaining <Ts are erased (Rule R>). Using R^, Z 0 is erased. So, 

(<7o a ,l b m d\ Z 0 ) p- ( 91 , A. Zo) |—(9u A, A) 

This means that d l b m d l e A(A). We can show' that 
A'(A) = {rf'iVIm. n > 1} 


by using Rules Ri-R 6 . 
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Define G = (V N , {<7, b }, P, S), where V N consists of 

[<7o> Zq, <7ol [<7b Z 0 . < 7 0 ], [q () , a, < 7 0 ], [q h a, q 0 ] 

[<7o- Zb, <7i], [<7i, Zb, </i]. [<7o- <?iL [?b «- <7il 

The productions in P are constructed as follows: 

The 5-productions are 

P|! 5 too. Zb, P< S ^ too* Z 0 , #j] 

<?too. Zb) = {(<7o* aZ o)l Educes 

P 3 : too Zq, ^7 0 ] a too. a, <7oltoo z o> <7ol 

P 4 ; too. Zq, 9o] rt too> 4lltob z o* 9 0 ] 

P 5 : too Z 0 , r/J -» atoo» ^ 9o]too z o- 9il 

^6 : too. z o* dll ^too. a * Hilton z o* <7il 

5too- cl a) - {to 0 , aa)} yields 

P 7 : too. «* <7ol -> ^too* <7o]to/o> a. ?o] 

P 8 : too* a, <? 0 ] “* ^too. ^* <7iltob «. 4ol 

P 9 : too. a - <7il —> rt too? <7oltoo. a * <7i 1 

P 10 : too- ^ (7il -» «too. ^ ?i]toi. «> ?il 

<5 too. b, a ) - tou «)} g ives 

P u : too a, r/ 0 ] -> fctoi. <7ol 

P 12 : too* ^ <7i] -> %b fl. dll 

S(q h b . a) = (ton a )l yields 

P 13 : toi* a, q 0 ] -> b[q h a , ry 0 ] 

P i4 : ton <*. <7il -> ^tob «. d\] 

S(q i- a. a) = {toi* A)} gives 

Pi5 : toi. a » <7i] “> a 

S(q\ , A. Zb) = {toi. A)} yields 

^Kv toi. Zq' <?il ~= > A 

'Vote: When the number of states is a large number, it is neither necessary 
nor advisable to write all the productions. We construct productions involving 
those variables appearing in some sentential form. Using the constructions in 
Chapter 6, we can simplify the grammar further. 

Theorem 7.5 The intersection of a context-free language L and a regular 
language R is a context-free language. 
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Proof Let L be accepted by a pda A - ( Q A , Z, T, <5 4 > <7o F 4 ) by final 

state and /? by DFA M = (Gm, I, <5 V , F 0 , F M ). 

We define a pda M' accepting L n R by final state in such a way that M' 
simulates moves of A on input a in £ and changes the state of M using S M . On 
input A, M' simulates A without changing the state of M. Let 

M' = ( Qm x Q a , Z, r, 5, [p 0 , <7o]> Zq- Fm x ^ 4 ) 
where 8 is defined as follows: 

<5([p, q]< a , X) contains ([P' s </], 7 ) when 4/(/X a) = p and S A (cp a , X) 
contains (<r/\ 7 ). <5([p, 9 ], A, X) contains ([/>, <?'], 7 ) when 8 A (cp A. X) 
contains (cf, 7 ). 

To prove T(M') = L n P we need an auxiliary result, i.e. 

(IPo- %l w > z o) 1-^ (Ip, <?]> A , y) (7.39) 

if and only if 

(<7o> w > 2b) lx A, y) and <5 w (p 0 , w) = p (7.40) 

We prove the ‘only if part by induction on i (the number of steps). If 
i = 0, the proof is trivial (In this case, p = po, q = vv = A and 7 = Zq). Thus 
there is basis for induction. Let us assume that (7.39) implies (7.40) when the 
former has i - 1 steps. 

Let (Oo, q 0 l Zo) \ip (lP> ql A, 7 ). This can be split into ([p 0 , q 0 l 
W a, Zq) fLj- ([p\ q\ a, 0) I— ([p. q], A, 7 ). where w = w'a and a is in Z 
or a = A depending on the last move. By induction hypothesis, w'e have 
(< 7 o, w\ Zq) (g\ A, j 8 ) and 5^', v/) = //. By definition of 5, ([//, r/j, 
^ P) ([p, ql A. 7 ) implies ( 4 , a, p) (tf, A, 7 ) and 4/(//, 0 ) = p. 
(iVate; p - p when <7 = A.) So, w'a) = a) = p. By combining 

the moves of A, we get (q 0 , w'a, Zq) (q\ a, p) ( q , A, 7 ), i.e. (</ 0 > w, 

Zq) jL- ( 4 , A, 7 ). So the result is true for i steps. 

By the principle of induction the ‘only if part is proved. 

We prove the ‘if part also by induction on i. It is trivial to see that there 
is basis for induction. 

Let us assume (7.40) with i - 1 steps. Assume that (<? 0 , w, Zq) f-L (q. A, 7 ) 
and 5vXPo> h?') = p. Writing w as w'a and taking 5^(p 0 ' w ') as p\ we E et 

(<7o, W'a, Zo) lx (<T «- Z 3 ) hr <*» A, /)• So, (q 0 , W, Z 0 ) |£ A/', A, p). 

By induction hypothesis, we get ([t?q, < 7 ol- w', Zq) j -^ 1 ([//, </']. A, fi). 
Also, 5 ^( 77 '. a) = p and (< 7 ', a, /J) |-j- (< 7 , A, y) implies ([//, < 7 '], a, P) Yjj 
(Ip, </], A, y). 
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Combining the moves, we get ([/? 0 . ^ 0 ], \\\ Z 0 ) ([ p , <y], A, 7 ). 

Thus the result is true for / steps. By the principle of induction, the ‘if’ 
part is proved. 

Note: In Chapter 8 we will prove that the intersection of two context-free 
languages need not be context-free (Property 3. Section 8.3). 


7A PARSING AND PUSHDOWN AUTOMATA 

In a natural language, parsing is the process of splitting a sentence into words. 
There are two types of parsing, namely the top-down parsing and the bottom- 
up parsing. Suppose we want to parse the sentence “Ram ate a mango.” If NP, 
VP, N, V, ART denote noun predicate, verb predicate, noun, verb and article, 
then the top-down parsing can be done as follows: 

5 -> NPVP 
—» Name VP 
—> Ram V NP 
—» Ram ate ART N 
—> Ram ate a N 
—» Ram ate a mango 

The bottom-up parsing for the same sentence is 

Ram ate a mango —> Name ate a mango 
—> Name verb a mango 
-> Name V ART N 
NP VN P 
-4 NP VP 
5 

In the case of formal languages. w 7 e derive a terminal string in L(G) by 
applying the productions of G. If we know' that w e Z* in L(G), then 5 => w. 
The process of the reconstruction of the derivation of \v is called parsing. 
Parsing is possible in the case of some context-free languages. 

Parsing becomes important in the case of programming languages. If a 
statement in a programming language is given, only the derivation of the 
statement can give the meaning of the statement. (This is termed semantics.) 

As mentioned earlier, there are two types of parsing: top-down parsing and 
bottom-up parsing. 

In top-down parsing, we attempt to construct the derivation (or the 
corresponding parse tree) of the input string, starting from the root (wdth label 
S') and ending in the given input string. This is equivalent to finding a leftmost 
derivation. On the other hand, in bottom-up parsing we build the derivation 
from the given input string to the top (root with label 5). 
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7.4.1 Top-down Parsing 

In this section we present certain techniques for top-down parsing which can 
be applied to a certain subclass of context-free languages. We illustrate them 
by means of some examples. We discuss LL(1) parsing, LL(A) parsing, left 
factoring and the technique to remove left recursion. 


EXAMPLE 7.10 


Let G = ({5, A, B} r (a, b }, P, S) where P consists of S ciAB , S —> bBA, 
A — > bS, A —> a. B —> as, B b. w ~ cibbbab is in L(G) . Let us tty to 
get a leftmost derivation of w. When we start with S we have two choices: 
S -> aAB and 5 —» bBA, By looking at the first symbol of w, we see that 
5 b£A will not yield >v. So we choose S aAP as the production to be 
applied in step 1 and we get S => aAB . Now consider the leftmost variable 
A in the sentential form aAB . We have to apply an A-production among the 
productions A —> and A —> a, A -> a will not yield w subsequently since 
the second symbol in >v is b. So, we choose A —> bS and get S aAB => 
abSB. Also, the substring ab of vt j is a substring of the sentential form abSB . 
By looking ahead for one symbol, namely the symbol b, we decide to apply 
S £PA in the third step. This leads to S => aAP => =» abbBAB, The 

leftmost variable in the sentential form abbBAB is B. By looking ahead for 
one symbol which is b, we apply the P-production B —» b in the fourth step. 
On similar considerations, we apply A —> a and B -» b in the last two steps 
to get the leftmost derivation. 

S aAB => abSB => abbBAB => abbbAB => abbbaB =$ abbbab 

Thus in the case of the given grammar, we are able to construct a leftmost 
derivation of w by looking ahead for one symbol in the input string. In order 
to do top-down parsing for a general string in L(G). we prepare a table called 
the parsing table. The table provides the production to be applied for a given 
variable with a particular look ahead for one symbol. 

For convenience, we denote the productions S —> aAB , S bBA , A —» 

bS, A —» a, B —» aS and B —> b by P\ % P 2 . P& Let E denote an error. 

It indicates that the given input string is not in L(G). The table for the given 
grammar is given in Table 7.1. 

TABLE 7.1 Parsing Table for Example 7.10 

A a b 

S E p 2 

A E P A P 3 

B _ E _Ps_ Pe_ 

For example, if A is the leftmost variable in a sentential form and the first 
symbol in unprocessed substring of the given input string is b, then we have to 
apply Py 
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A grammar possessing this property (by looking ahead for one symbol in 
the input string we can decide the production to be applied in the next step) 
is called an LL(l) grammar. 


EXAMPLE 7.11 


Let G be a context-free grammar having the productions 5 —» F + 5, 5 —> 

F * 5, 5 —> F and F a. Consider w = a + a * a. This is a string in L(G). 

Let us try to get the top-down parsing for w. 

Looking ahead for one symbol will not help us. For the string a + a * a , 
we can apply F —> a on seeing cl But if a is followed by + or *, we cannot 
apply a . So in this case it is necessary to look ahead for two symbols. 

When we start with 5 we have three productions S F + S, 5 —> F * 5 
and 5 —> F. The first two symbols in a + a. * a are a +. This forces us to 
apply only 5 —» F + 5 and not other 5-productions. So, 5 F + 5. We can 
apply F —> a now to get 5 => F + 5 => a + 5. Now the remaining part of 
h’ is a * a . The first two symbols a * suggest that we apply 5 -» F * 5 in 
the third step. So, 5 => a + 5 =» a + F * 5. As the third symbol in w is a , 

we apply F —> a, yielding 5 a + F * 5 => a + <rf * 5. The remaining part 

of the input string \v is a So, we have to apply S —> F and F —> a. Thus the 
leftmost derivation ofa + fl*ais5=>F + 5=>a + 5=>fl + F* 
5 => a + a * 5 => a + a * F =» a + a *a. 

As in Example 7.10, we can prepare a table (Table 7.2) which enables us 
to get a leftmost derivation for any input string. P u P 2 , P 3 and P 4 denote the 
productions 5 -4 F + 5, 5 -4 F * 5, 5 —» F and F —> a. E denotes an error. 


TABLE 7.2 Parsing Table for Example 7.11 



A 

a 

+ 

* 

aa 

a+ 

a* 

s 

E 

P 3 

E 

E 

E 

Pi 

Pi 

F 

E 

P4 

E 

E 

E 

P4 

Pa 


+a 

++ 

+* 

*a 

*+ 

** 


S 

E 

E 

E 

E 

E 

E 


F 

E 

E 

E 

E 

E 

E 



For example, if the leftmost variable in a sentential form is F and the next 
two symbols to be processed are a *, then we apply P 4? i.e. F —> a. When we 
encounter * a as the next two symbols, an error is indicated in the table and 
so the input string is not in L(G). 

A grammar G having the property (by looking ahead for k symbols we 
derive a given input string in L(G)), is called an LL(&) grammar. The grammar 
given in Example 7.11 is an LL(2) grammar. 
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In Examples 7.10 and 7.11 for getting a leftmost derivation, one 
production among several choices was obtained by look ahead for k symbols. 
This kind of nondeterminism cannot be resolved in some grammars even by 
looking ahead. 

This is the case when a grammar has two A-productions of the form A —? 
a/3 and A —> ay By a technique called ‘left factoring’, we resolve this 
nondeterminism. Another troublesome phenomenon in a context-free grammar 
which creates a problem is called left recursion. A variable A is called left 
recursive if there is an A-production of the form A —> A a. Such a production 
can cause a top-down parser into an infinite loop. Left factoring and technique 
for avoiding left recursion are provided in Theorems 7.6 and 7.7. 

Theorem 7.6 Let G be a context-free grammar having two A-productions of 
the form A -> a/3 and A -» ay If A -> a/3 and A -» ay are replaced by 
A —> a A'. A' 1 3 and A' —> y where A' is a new variable then the resulting 
grammar is equivalent to G . 

Proof The equivalence can be proved by showing that the effect of applying 
A —> a/3 and A —> ay in a derivation can be realised by applying A —> aA\ 
A' —> /3 and A' —» y and vice versa. 

Note: The technique of avoiding nondeterminism using Theorem 7.6 is 
called left factoring. 

Theorem 7.7 Let G be a context-free grammar. Let the set of all 
A-productions be {A —> A a u .... A -» Aa iv A /3j. .... A —> j3 m }. Then 
the grammar G' obtained by introducing a new r variable A 7 and replacing all 
A-productions in G by A P\A\ .. .. A —> fi nt A\ A' —» a\A\ ..A' —> a it A' 
and A' —» A is equivalent to G. 

Proof Similar to proof of Lemma 6.3. 

Theorems 7.6 and 7.7 are useful to construct a top-down parser only for 
certain context-free grammars and not for all context-free grammars. We 
summarize our discussion as follows: 

Construction of Top-Down Parser 

Step 1 Eliminate left recursion in G by repeatedly applying Theorem 7.7 to 
all left recursive variables. 

Step 2 Apply Theorem 7.6 to get left factoring wherever necessary. 

Step 3 If the resulting grammar is LL(£) for some natural number k , apply 
top-down parsing using the techniques explained in Examples 7.10 and 7.11. 


EXAMPLE 7.12 


Consider the language consisting of all arithmetic expressions involving +, *, 
( and ) over the variables xl and x2. This language is generated by a grammar 
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G = ({T, F, F} Z, F, F). where I = {a\ L 2, +, *, (,)} and P consists of 

F -a F + T F -> (F) 

F -> T F —» xl 

F —» F * F F —> x2 

F —» F 

Let us construct a top-down parser for L(G). 

Step 1 We eliminate left recursion by applying Theorem 7.7 to the left 
recursive variables F and 71 We replace F —> F + F and F —> T by F —» TE\ 
E' —» + FF' and £' A (F' is a new variable). Similarly, F —> F * F and 
F —» F are replaced by F —> FF'. F' * FT and T —> A. The resulting 
equivalent grammar is 

G) - ({F. F, F F\ £'}, I, P'. F). where P' consists of 
F FF' F A 

F'-> + FF' F —> (F) 

£'-> A F —> aT 

F -a FT F a2 

F'-> * FF' 

Step 2 We apply Theorem 7.6 for left factoring to F —» xl and F x2 to 
get new productions F —» ;cA r —> A —> 1 and N 2. 

The resulting equivalent grammar is 

G : = ({F. F. F. F', F 7 }. I. P". E) where P" consists of 


P,: E 

-4 TE' 

P 6 ■ 

T 

-4 A 

ha 

i j 

hi 

-4 + ££' 

Pi : 

F 

-4 (£) 

Py E' 

-4 A 

A : 

F 

-4 xiV 

Py T 

-4 FT' 

: 

N 

-4 1 

Py T" 

-4 * FT' 

Pio: 

N 

-4 2 


Step 3 The grammar G^ obtained in step 2 is an LL(1) grammar. The 
parsing table is given in Table 7.3. 


TABLE 7.3 Parsing Table for Example 7.12 



A 

X 

1 

2 

+ 

* 

( 

) 

E 

E 

P ! 

E 

E 

E 

E 

Pi 

E 

T 

E 

F 

E 

E 

E 

E 

P 4 

E 

h 

E 

P 8 

E 

E 

E 

E 

P? 

E 

V 

Pq 

E 

E 

E 

E 

Po 

E 

P 6 

E' 

P 3 

E 

E 

E 

P 2 

E 

E 

P 3 

N 

E 

E 

Pa 

P 10 . 

E 

E 

E 

E 
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7.4.2 Top-down Parsing Using Deterministic pda’s 

We have seen that pda's are the accepting devices for context-free languages. 
Theorem 7.3 gives us a method of constructing a pda accepting a given context- 
free language by empty store. In certain cases the construction can be modified 
in such a way that a leftmost derivation of a given input string can be obtained 
while testing to know whether the given string is accepted by the pda. This 
is the case when the given grammar is LL(1). We illustrate this by 
constructing a (deterministic) pda accepting the language given in 
Example 7.10 and a leftmost derivation of a given input string using the pda. 


EXAMPLE 7.13 


For the grammar given in Example 7.10, construct a deterministic pda 
accepting L(G ) and a leftmost derivaiton of abbab. 

Solution 

We construct a pda accepting L(G )S ($ is a symbol indicating the end of the 
input string). This is done by using Theorem 7.3. The transitions are 

S(q. A, A) = {(<?, a) | A —> a is in P} 

S(q. t, t) = {{q. A)} for every t in £ 

This pda is not deterministic as we have two 5-productions, two 
A-productions. etc. In Example 7.10 we resolved the nondeterminism by 
looking ahead for one more symbol in the input string to be processed. In the 
construction of pda this can be achieved by changing the state from q to q a 
on reading a. When the pda is in state q a and the current symbol is S we 
choose the transition resulting in ( q, aAB). Now the deterministic pda 
accepting L(G )S by null store is 

A = ({p, q, q u , q h }. {a, b. $}, {5, A. B. a, b. Z 0 }, <5, p, Zq, 0) 
where 8 is defined by the following rules: 

Ri ■ 8{p , A, Zq) = (q. s) 

R 2 ■ 8(q, a. A) = (q u , A) 

R 3 : 8(q a , A. a) = (q, e) 

Ra ■ 8(q , b. A) = (c lh . A) 

R 5 : S(q a . A. b ) = (q. e ) 

R 6 : 8(q a , A, 5) = (q a . aAB ) 

R-i : 8(q h . A. S) = (q h . bBA ) 

/? g : 8{q a , A. A) = {q a , a) 

R 9 : 8(q h , A, A) = (q h , bS) 
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*10 : A, B) = to,, aS) 

*n : 8(q h . A, B)= to, Z>) 

*12 : <$to $• Zq) = to A) 

Here R { changes the initial ID (p, w, Z) into (#, >v, SZ). i? 2 an< i ^4 are f° r 
remembering the next symbol. R(-R\\ are simulating the productions. R 3 and R 5 
are for matching the current input symbol and the topmost symbol on PDS and 
for erasing it (in PDS), Finally, R l2 is a move for erasing Z and making the 
PDS empty when the last symbol $ of the input string is read. 

To get a leftmost derivation for an input string w, apply the unique 
transition given by to R j 2 . When we apply /? 6 to Ru, we are using a 

corresponding production. By recording these productions we can test whether 
w e L(G) and get a leftmost derivation. The parsing for the input string 
abbbab is given m Table 7.4. 

The last column of Table 7.4 gives us a leftmost derivation of abbbab. It 
is S => ciAB => abSB => abbBAB => abbbAB => abbbaB => abbbab . 


TABLE 7.4 Top-down Parsing for w of Example 7.13 


Step State Unread input Pushdown stack Transition 

used 


1 

P 

abbbab$ 

Zo 

— 

2 

d 

abbbab $ 

SZ Q 


3 

da 

bbbab$ 

sz 0 

r 2 

4 

da 

bbbabS 

aABZ 0 

r 6 

5 

d 

bbbab$ 

ABZq 

r 3 

6 

db 

bbab§ 

ABZq 

Ra 

7 

do 

bbab$ 

bSBZ 0 

Rq 

8 

d 

bbabS 

SBZ 0 

Rs 

9 

db 

bab$ 

SBZ 0 

Ra 

10 

db 

bab$ 

bBABZo 

R- 

11 ’ 

d 

bab$ 

BABZ 0 

Rs 

12 

db 

ab$ 

BABZq 

Ra 

13 

db 

aPS 

bABZ 0 

to 

14 

d 

aPS 

ABZq 

Rs 

15 

da 

PS 

ABZq 

r 2 

16 

da 

PS 

sBZq 

Rs 

17 

d 

PS 

bz q 

r 3 

18 

db 

S 

BZq 

Ra 

19 

db 

$ 

bZ 0 

Ru 

iS* J 

d 

$ 

Zo 

Rs 

21 

d 

A 

A 

R\2 


Production 

applied 


S -> aAB 

A -> PS 


S bBA 


B b 


A a 


B b 
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7.43 Bottom-up Parsing 

In bottom-up parsing we build the derivation tree from the given input string 
to the top, (the root with label S). For certain classes of grammars, called weak 
precedence grammars, we can construct a deterministic pda which acts as a 
bottom-up parser. We illustrate the method by constructing the parser for the 
grammar given in Example 7.12. 

In bottom-up parsing we have to reverse the productions to get S finally. 
This suggests the following moves for a pda acting as bottom-up parser. 

(i) S(p. A, or) = {(P, A) |there exists a production A a) 

(ii) S(p, a. A) - {(p. <T)} for all a in I. 

Using (i) we replace a 1 on the basis by A when A —» a is a production. 
The input symbol c is moved onto the stack using (ii). For acceptability, we 
require some moves when the PDS has S or Z 0 on the top. 

As in top-down parsing we construct the pda accepting L(G)$. Here we 
will have two types of operations, namely shifting and reducing. By shifting 
we mean pushing the input symbol onto the stack (moves given by (ii)). By 
reducing we mean replacing a 1 by A when A —> a is a production in G 
(moves given by (i)). 

At every step we have (i) to decide whether to shift or to reduce (ii) to 
choose the prefix of the string on PDS for reducing, once we have decided to 
reduce. For (i) we use a relation P called a precedence relation. If ( a , b) e P 
where a is the topmost symbol on PDS and b is the input symbol then we 
reduce. Otherwise we shift b onto the stack. Regarding (ii), we choose the 
longest prefix of the string on the PDS of the form or to be reduced to A ( when 
A —> a is a production). 

We illustrate the method using the grammar given in Example 7.12. 


EXAMPLE 7.14 


Construct a bottom-up parser for the language L(G)$. where G is the grammar 
given in Example 7.12. 

Here the productions are E —» E + T\ E 7\ T —> T * F, T —» F, 
F —> (£), F —> x 1 and F —> x2. Using these productions, we can construct 
the precedence relation P. It is given in Table 7.5. If (a, b) is in P 7 then we 
have a tick mark ; v ? in the (a. b ) cell of the table. Using Table 7.5 we can 
decide the moves. For example, if the stack symbol is F and the next input 
symbol is *, then we apply reduction. If the stack symbol is £, then any input 
symbol is pushed onto the stack. 



Chapter 7: Pushdown Automata U 259 


TABLE 7.5 

The Precedence Relation for Example 7.14. 


Stack symbol/ v 

Input symbol 

( .) 1 2 + 

S 


Zo 


A 


( 





) 

V 

■/ 

X 

X 

1 

/ 

/ 

X 

/ 

2 

/ 

/ 

/ 

X 

+ 





E 





T 

✓' 

■/ 


X 

F 

✓ 


✓ 

X 


Using the precedence relation and the moves given at the beginning of this 
section we can construct a deterministic pda. As in the construction of top-down 
parser, when we look ahead for one symbol we 'remember’ it by changing state. 
The deterministic pda which acts as a bottom-up parser is 

a = ca r. r <5, P . z G . 0 ) 

where 

r = I u {$}. Q = {p} u {p a : a E n 

r= t£, f, f} u i u {zb, 5} 

and <5 is given by the following rules: 

R\ : S(p. cr. A) = (p. A) 

F 2 : <5(Pcr< A, a) = (p, a a) for all (a . a) E P 

: <5(p a . A. T + F) = (p^ F) w ? hen (F, o) e P 

F 4 : <5(p a , A, Fa) = (p^, Fa) when (F, a) 6 P and a e F - {+} 

Ra : <50?^. A. F* F) = (p a . F) when (F, o) e P 

Re • Slpcr, A. Fa) = (p<y, Fa) when (F. a) e P and a e T - {*} 

Ft : <5(p a . A, FQ = (p a . F) when ( ). a) e F 

F* : <5(p a? A. L\) = (p a , F) when (1, a) e P 

Re, : S (p a . A, 2r) = (p^, F) when (2. a) € F 

F 10 : 5(p s , A, F) = (p s? A) 

Fn : 5(p s . A. Zq) = (p s . A) 

As A is deterministic, we have dropped parentheses on the R.H.S. of 
R\ - F n . Using the pda A. we can get a bottom-up parsing for any input string 
vv. The bottom-up parsing for aT + (xl) is given in Table 7.6. 
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Table 7,6 Bottom-up Parsing for x\ + (x2) 

Step 

State 

Unread 

input 

Pushdown 

stack 

Rule 

used 

Production 

applied 

1 

P 

vl + (v2) 

Z 0 

— 


2 

Px 

1 + (,v2) 

Zo 

R; 


3 

P 

1 + (.v2) 

a-Z q 

r 2 


4 

Pi 

+ (a-2) 

A-Z 0 

Ri 


5 

P 

+ (.v2) 

1 xZq 

r 2 


6 

P+ 

+ <a2) 

1a-Z 0 

Ri 


7 

P+ 

+ (a'2) 

FZ 0 

Rs 

F -» , v1 

8 

P + 

+ (a'2) 

TZ 0 

Re 

T -> F 

9 

P + 

+ (a-2) 

EZ C 

r 4 

E-^T 

10 

P 

(a-2) 

+EZ 0 

Ri 


11 

P( 

a-2) 

+EZ 0 

Ri 


12 

P 

a-2) 

(+ez 0 

r 2 


13 

Px 

2) 

(+EZo 

Ri 


14 

p 

2) 

a-(+HZ 0 

Ri 


15 

Pi 

) 

a(+£Z 0 

Ri 


16 

p 

) 

2a(+£Z 0 

r 2 


17 

P) 

$ 

2a(+£Z 0 

Ri 


18 

P) 

$ 

F(+EZ 0 

Rs 

F a-2 

19 

P) 

$ 

T(+EZ q 

Re 

T -> F 

20 

P) 

$ 

£( + £Z 0 

Ra 

E ^ T 

21 

p 

S 

)E(+EZ 0 

R 2 


22 

Ps 

A 

)E( + EZ 0 

Ri 


23 

Ps 

A 

F+EZq 

R? 

F -» (E) 

24 

Ps 

A 

T+EZq 

Re 

T -> F 

25 

Ps 

A 

EZ 0 

Ro 

E E+ T 

26 

Ps 

A 

Zo 

Rio 


27 

Ps 

A 

A 

Rn 



By backtracking the productions that we applied, we get a rightmost 
derivation E=3>E+T=>E + F=>E + (£)=>£+ (7) E + (F) => £ + (x2) 
T + (x2) => F + (x2) => A'l + (x2). 

In Chapter 8 we will discuss how LR(£) grammars are amenable for 
parsing. 

7.5 SUPPLEMENTARY EXAMPLES 


EXAMPLE 7.15 


Construct a pda accepting all palindromes over {a. b }. 

Solution 

Let L ~ {w e { a , fc}* |w = w 7 }. Before constructing the required pda, note 
that L consists of palindromes of odd or even length. If w in L is of odd 
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length* there is a middle symbol which need not be compared with any other 
symbol. If w in L is of even length, the rightmost symbol of the first half is 
the same as the leftmost symbol of the second half. The key idea of the 
construction is to store the symbols in the first half (or the symbols lying to 
the left of the middle of the input string) and matching them with the symbols 
in the second half (or the symbols to right of the middle of the input string). 
If there is no matching, the machine halts. 

We define the pda M as follows: 

M = ({q Q , cji, < 72 }, {a, b}, {a, b, Zq}, 8, q 0 , Zq, {<? 2 }) 
where <5 is defined by 


<S(< 7 o, a, Zq) = 

{07o> 

aZi), 

(9b 

: Zo)} 

(7.41) 

S(q 0 , b , Zq) = 

{(<7o. 

bZo), 

(9b 

■ Zq)} 

(7.42) 

8(q 0 , a. a) = 

{(<7o> 

aa), 

(9b 

a)} 

(7.43) 

<5(<7q, b. a) = 

{(4o- 

ba ), 

(9b 

a)} 

(7.44) 

Oo 

§ 

II 

{(9o> 

ab). 

(9b 

b)} 

(7.45) 

8(q 0 , b, b ) = 

{(9o. 

bb). 

(9b 

b)} 

(7.46) 

S(q Q , A. Z 0 ) = 

{(9i- 

m 



(7.47) 

<5(<?o, A, a) = 

{(9b 

«)} 



(7.48) 

8(q 0 , A. b)~ 

{(9b 

b)} 



(7.49) 

8{q x , a, a) = 

{(9i- 

A)} 



(7.50) 

8(q u b, b) = 

{(9b 

A)} 



(7.51) 

8(q u A. Z 0 ) = 

{(92- 

Zq)} 



(7.52) 


Obviously, M is a nondeterministic pda. (7.41)-(7.46) give us two 
choices. The first choice can be used for storing the input symbol without 
changing the state. (7.47)-(7.49) are used for indicating that the first half of 
the input string is over; there is change of state in this case. The second choice 
of (7.41)-(7.46) is used when w is a palindrome of odd length and the middle 
symbol of w is reached; in this case, we ignore the middle symbol and make 
no change in PDS. (7.50) and (7.51) are used to match symbols of the input 
string (second half) and cancel the symbol in PDS w ; hen they match. (7.52) 
is used to move to the final state q 2 after reaching the bottom of PDS. 

We can prove that L is accepted by M by final state. The reader can take 
an odd palindrome and an even palindrome and check that the final state q 2 
is reached. 


EXAMPLE 7.16 


Construct a deterministic pda accepting L = {w e {a, Z?}* | the number of a's 
in w equals the number of b 's in \v} by final state. 



262 id Theory of Computer Science 


Solution 

We define a pda M as follows: 

M = ({q 0i q x \. {a, b}, {a. b, Z 0 }, <S. q 0 , Z 0? {^}) 


where 5 is defined by 

5(r/ 0 , a, Zq) = {(q u Z 0 )j (7-53) 

<5(r/ 0 , Zq) = {((j 0? Z?Z 0 )} (7.54) 

5(<Zo a, fe)= {(tf 0 . A)} (7.55) 

5(q 0 , b , £) = {(r/ 0 . to)} (7.56) 

5(ry b a, Z 0 ) = {(<?}. aZ 0 )} (7.57) 

5(9!. b. Z 0 )= {(q 0 , aZ 0 )} (7.58) 

5(r/i, a, a) = {(r/ 1? aa)} (7.59) 

<5(g 1? a) = {(g i? A)} (7.60) 


The construction can be explained as follows: 

If the pda M is in the final state q h it means it has seen more a 's than 
b's. On seeing the first a , M changes state (from q 0 to q { ) ((7.53)). Afterwards 
it stores the as in PDS without changing state ((7.57) and (7.59)). It stores 
the initial b in PDS ((7.54)) and also the subsequent ms ((7.56)). The pda 
cancels a in the input string, with the first (topmost) b in PDS ((7.55)). If all 
b's are matched with stored a' s, and M sees the bottom of PDS, M moves 
from c[\ to r/ 0 ((7.58)). The Vs in the input string are cancelled on seeing a 
in the PDS (7.60)). 

M is deterministic since S is not defined for input A. The reader is advised 
to check that q { is reached on seeing an input string \v in L. 


EXAMPLE 7.17 


Construct a pda M accepting L = {a l h l c k | i =j or j = A'} by final state. 

Solution 

We define pda M as follows: 


({<70 <?!• • 


{ 

a, b, 

c}, (Zq, 

X}, 

5, r/ 0 , Zq, 

{<7b ft}) 

given by 








<5(<7o 

A, 

Z 0 ) = 

{(<7i- 

Zq). (</2 

■ Zq), 

(#3? 2 q)} 

(7.61) 


c , 

Zq) = 

{(<71 ■ 

. Zq)} 



(7.62) 

8(q y 

a . 

Zq) = 

{(ft, 

, XZ 0 )} 



(7.63) 

5{q z . 

, a, 

X) = 

{(<72 

. XX)} 



(7.64) 

8(q z . 

} b. 

X) = 

(<74, 

b. X) = 

{(<74, 

A)} 

(7.65) 

<$to- 

A, 

z 0 ) = 

{to, 

, Zq)} 



(7.66) 
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5(q } . a. Z 0 )= {(< 73 . Zq)} (7.67) 

S(q 3 , b, Zq) = {(q s . XZq)} (7.68) 

< 5 (<r/ 5 . b. X)= {(< 75 , AT)} (7.69) 

S(q 5 . c. X) = <5(4,. c. X) = {(< 76 , A)} (7.70) 

<5(96: A. Z n ) = {(< 73 . A)} (7.71) 

The states q { , q 2 . qy stand for cPb a c k , a'b'c k , a'b'c 1 respectively where 
i > 0, j > 0. k > 0. (7.61) indicates the initial guess. The three choices 
correspond to the three cases. 

(< 7 <, c\ Zq) = (<7o, Ac*. Zq) j— (q x . c k . Zq) |— (q b A, Zq) 
by (7.61) and (7.62). As q\ is a final state. c k e T(M). 

The pda in state q 2 will not change state and stores a's in the input string 
as X‘s in PDS ((7.63) and (7.64)). On seeing the first b after many a's. M 
changes its state to q± and cancels X in PDS for subsequent b's ((7.64)). If 
it reaches the bottom of PDS. M goes back to q 3 . which is an accepting state 
((7.76)). So M accepts db\ It continuous to be in state q L on seeing c's 
subsequently ((7.62)). So, M accepts db l d\ 

For dealing with dVcK M makes the initial guess using (7.61) and reaches 
state q 2 . It simply reads a's without changing state or PDS ((7.67)). 
M subsequently replaces b with X and changes to state q 5 . Afterwards M 
goes on changing V s to X's ((7.69)). On seeing a c, M changes state. 
Subsequent ds are matched with X's (which correspond to b's read earlier) 
and X's in PDS are cancelled. On reaching the bottom of PDS, M reaches c/ 3 , 
a final state ((7.71)). 

Thus. a°b°c k . dlxc k , dVd e T(M) for / > 0, j > 0. k > 0. Hence. 
T(M) = L 


EXAMPLE 7.18 


Convert the grammar S aSb | A. A —> bSa | S | A to a pda that accepts the 
same language by empty stack. 

Solution 

We construct a pda A as 

A = ({q}. {a . b }. {5. A. a, b}. <5. q, S. 0) 
where 8 is defined by the following rules 

S(q< A, S) = {(q. aSb), (q> A)} 

8(q. A, A) = {(q 9 bSA ), (#. S). (<?. A)} 

<5(ry. a. a) - {(<?. A)} 

<56?, ft, fe) = {rtf A)} 
and A is the required pda. 
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EXAMPLE 7.19 


If A is a pda, then show that there is a one-state pda A, such that 

N(A) = N(A X ). 

Solution 

By Theorem 7.4, we get a context-free grammar G such that L(G) = AT(A). 
Denote G by (K v . I, P, 5). By Theorem 7.3, we can get a one-state pda A } 
given by 

Aj = ({ 9 }, Z, V* u I, 5, q, S, 0) 
such that iV(Aj) = L(G). 

SELF-TEST 

Choose the correct answer to Questions 1-6. 

1. If 5(q, a h Z i) contains (q\ /?), then 

(a) ( 9 , ^ 1 ^ 2 ’ ZT) |— (cj . ch, /?Z 2 ) 

(b) ( q , ZjZ 2 ) j— (9 ? /^Z 2 ) 

(c) ( 9 , Z 2 ) |— (q\ ci\, Zj) 

(d) ( 9 , Z L Z 2 ) |— ( 9 '. a 2; ZjZ 2 ) 

2 . In a deterministic pda, | S(q, a. Z) | is 

(a) equal to 1 

(b) less than or equal to 1 

(c) greater than 1 

(d) greater than or equal to 1 

3. In a deterministic pda: 

(a) S(q, a. Z) = 0 => 6 ( 9 , A, Z) ^ 0 

(b) m Z) 0 => <5(ry. A, Z) = 0 

(c) 5 ( 9 , A, Z) ^ 0 => 5 ( 9 , a, Z) ^0 

(d) 5(9, A, Z) 0 =» 5 ( 9 . a, Z) =0 

4. > 1} is accepted by a pda 

(a) by null store and also by final state. 

(b) by null store but not by final state. 

(c) by final state but not by null store. 

(d) by none of these. 

5. {a u b Zn \n > 1} is accepted by 

(a) a finite automaton 

(b) a nondeterministic finite automaton 

(c) a pda 

(d) none of these. 



Chapter 7: Pushdown Automata B 265 


6. The intersection of a context-free language and a regular language is 

(a) context-free 

(b) regular but not context-free 

(c) neither context-free nor regular. 

(d) both regular and context-free. 

Fill up the blanks: 

7. In bottom-up parsing, we build the deviation from _ to 


8 . In LR(1) grammar, we can decide the production to be applied in the 

next step by , . 

9. w e 7(A), where A is a pda if (q 0 , w, Zq) p- _ 

10. iv e A-(A), where A is a pda if (go, w, Zq) p~ . . 

EXERCISES 

7.1 If an initial ID of the pda A in Example 7.2 is (q 0 > aacaa . Z 0 ), what 
is the ID after the processing of aacaat If the input string is (i) abcba , 
(ii) abcb, (iii) acba , (iv) abac . (v) abab , will A process the entire 
string? If so, what will be the final ED? 

7.2 What is the ID that the pda A given in Example 7.5 reaches after 
processing (i) <rZr. (ii) a 2 b\ (iii) <r\ (iv) Ir. (v) b 3 a 2 , (vi) ababab if 
A starts with the initial ID? 

7.3 Construct a pda accepting by empty store each of the following 
languages. 

(a) " {a n b fn a"\m t n > 1} 

(b) {a”b 2u \n > 1 } 

(c) {flW|w, n > 1 } 

(d) {a m b n \m > n > 1 } 

7.4 Constmct a pda accepting by final state each of the languages given in 
Exercise 7.3. 

7.5 Construct a context-free grammar generating each of the following 
languages, and hence a pda accepting each of them by empty store. 

(a) [a n b n | n > 1 } u {a m b 2m \rn > 1 } 

(b) {a n b m a u \m. n > 1} u {a H c u \n > 1 } 

(c) {a n b m c m d n \nn n > 1 } 

7.6 Let L - {a m b n \ n < m}. Construct (i) a context-free grammar accepting 
L, (ii) a pda accepting L by empty store, and (iii) a pda accepting L 
by final state. 
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7.7 Do Exercise 7.6 by taking L to be the set of all strings over {a. b] 
consisting of twice as many a s as //s. 

7.8 Construct a pda accepting the set of all even-length palindromes over 
{ a . b } by empty store. 

7.9 Show that the set of all strings over { a , b} consisting of equal number 
of a's and b> s is accepted by a deterministic pda. 

7.10 Apply the construction given in Theorem 7.4 to the pda M given in 
Example 7.1 to get a context-free grammar G accepting N{M). 

7.11 Apply the construction given in Theorem 7.4 to the pda obtained by 
solving Exercise 7.4. 

7.12 Show that {ci l b n | n > 1} u {a m b 2m \ m > 1} cannot be accepted by a 
deterministic pda. 

7.13 Show that a regular set accepted by a deterministic finite automaton 
with n states is accepted to final state by a deterministic pda with n 
states and one pushdown symbol. Deduce that every regular set is a 
deterministic context-free language. 

(A context-free language is deteiministic if it is accepted by a determi¬ 
nistic pda.) 

7.14 Show that every regular set accepted by a finite automaton with n states 
is accepted by a deterministic pda with one state and n pushdown 
symbols. 

7.15 If L is accepted by a deterministic pda A. then show that L is accepted 
by deterministic pda A which never adds more than one symbol at a 
time (i.e. if 8(q. a , z) = (q\ /). then | y\ < 2). 

7.16 If L is accepted by a deterministic pda A, then show that L is accepted 
by a deterministic pda A which always (i) removes the topmost symbol 
or (ii) does not change the topmost symbol, or (in) pushes a single 
symbol above the topmost symbol 
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In this chapter we study LR{k) grammars (a subclass of context-free grammars) 
which play an important role in the study of programming languages and the 
design of compilers. For example, a typical programming language such as 
ALGOL has LR(I) parser. 

8.1 LR(k) GRAMMARS 

In Chapters 4 and 6 we were mainly interested in generating strings using 
productions and in performing the membership test. In the design of 
programming languages and compilers, it is essential to develop the parsing 
techniques, i.e. techniques for obtaining the ‘reverse derivation' of a given 
string in a context-free language. In other words, we require techniques to find 
a derivation tree for a given sentence w in a context-free language. 

To find a derivation tree for a given sentence vti we can start with w and 
replace a substring, say W) of w, by a variable A if A w\ is a production. We 
repeat the process until we get S. But this is more easily said than done, for at 
every stage there may be several choices and we have to choose one among 
them. If we make a wrong choice, we will not get 5, and in this case we have 
to backtrack and try some other substring. However, for a certain subclass of 
context-free grammars, it is possible to carry out the process, i.e. getting the 
derivation in the reverse order for a given string w in a deterministic way. LR(&) 
grammars form one such subclass. Here, LR(&) stands for left-to-right scan of 
the input string producing a rightmost derivation using the k symbol look¬ 
ahead on the input string. 

Before discussing the LR(£) grammars, we should note that although 
parsing gives only the syntactical structure of a string, it is the first step in 
understanding the ‘meaning' of the sentence. 
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Consider some sentential form afiw of a context-free grammar G, where 
a, (3 e (V 7 , v u Z)* and w e Z*. Suppose we are interested in finding the 
production applied in the last step of the derivation for aj3w. If A —> is a 
production, it is likely that A —> /3 is the production applied in the last step, 
but we cannot definitely say that this is the case. If it is possible to assert that 
A —> /3 is the production applied in the last step by looking ahead for k 
symbols (i.e. k symbols to the right of /3 in afiw ), then G is called an LR(L) 
grammar. The production A —> (3 is called a handle production and f3 is called 
a handle. 

We write a => ft if /3 is derived from a by a right-most derivation. Before 

R 

giving the rigorous definition of an LR (k) grammar, let us consider a grammar 
for which parsing is possible by looking ahead for one symbol. 


EXAMPLE 8.1 


Let G be S —> AB , A —» aAb, A —» A, B -» Bb, B —> b. It is easy to see 
that L(G) = {a m b"\ n > in > 1}. Some sentential forms of G obtained by 
right-most derivations are AB , ABb k . ci m Ab m b k , a m b m+k , where & > 1. AB 
appears as the R.H.S. of S AB. So AB may be a handle for AB or ABb k . 
If we apply the handle to AB, we get S ^ AB. If we apply the handle to ABb k , 

we get Sb k => ABb k . But Sb k is not a sentential form. So to decide whether 
AB can be a handle, we have to scan the symbol to the right of AB. If it is 
A, then AB serves as a handle. If the next symbol is b, AB cannot be a handle. 
So only by looking ahead for one symbol we are able to decide whether AB 
is a handle. Let us consider crV. As we scan from left to right, we see that 
the handle production A -4 A may be applied. A can serve as a handle only 
when it is taken between the rightmost a and the leftmost b. In this case w'e 
get crAb~ => crb~\ and we are able to decide that A A is a handle 

~ R 

production only by looking ahead of one symbol (to the right of A). If A is 
taken between two ci s. we get aAcib' => crb\ But aAab J is not a sentential 

R 

form. Similarly, we can see that the correct handle production can be 
determined by looking ahead of one symbol for various sentential forms. 

A rigorous definition of an LR(A') grammar is now given. 

Definition 8.1 Let G = (V 7 v . I. P, S ) be a context-free grammar in which 
S => S only when n - 0. G is an LR (k) grammar (k > 0) if 

(i) S => aAw => a/3\\\ where a, f3 e Vy- w g ZA 

r r 

(ii) S => a'A'w' => a'B'w'. where a', B' e V*. v/ e Z*, and 

R R 

(iii) the first \a/3\ + k symbols of a/3vv and cc'fi'w' coincide. Then a - a', 
A = A', fi = P'. 

Remarks 1. If a[3\v or a/fi'w' have less than |a/3| + k symbols, we add 
some 'blank symbols', say $. on the right and compare. 
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2. It is easy to see how we can get the derivation tree for a given 
terminal string. For getting the derivation tree, we want to get the derivation 
“in the reverse order”. Suppose a sentential form afiw is encountered. We can 
get a right-most derivation of fiw in the following way: If A -> fi is a 
production, then we have to decide whether A fi is used in the last step of 
a right-most derivative of afiw. On seeing k symbols beyond fi in afiw, we 
are able to decide that A —> fi is the required production in the first step. For, 
if a'fi'w' is another sentential form satisfying condition (hi), then we can 
apply A f —r fi' in the last step of a right-most derivation of a'fi'w'. But by 
definition it follows that A = A', fi = fi' and a = a\ So A fi is the only 
possible production we can apply and we are able to decide this after 'seeing’ 
the k symbols beyond fi. We repeat the process until we get S. 

3. If G is an LR(A) grammar, it is an LR(A') grammar for all k' > k. 


EXAMPLE 8.2 


Let G be the grammar S —> a A, A —> Abb } b. Show that G is an LR(O) 
grammar. 

Solution 

It is easy to see that any element in L(G) is of the form ab“ tl+L . The sentential 
forms of G. are ciA, a.Ab 2j \ ab~ n+l . Let us find out the last production applied 
in the derivation of cib~ m \ As aA, Abb , b are the possible right-hand sides 
of productions, only A —» b can be the last production; w T e are able to decide 
this without looking at any symbol to the right of b. Similarly, the last 
productions for aAb 2n and aA are A —> Abb and S —» aA, respectively. (We 
are able to say that A —» Abb is the last production for any sentential form 
aAlr n for ail n > 1.) Thus, G is an LR(Q) grammar. 


EXAMPLE 8.3 


Consider the grammar G given in Example 8.1. Show that G is an LR(1) 
grammar, but not an LR(0) grammar. Also, find the derivation tree for a 2 b A . 

Solution 

In Example 8.1 we have shown that for sentential forms of G we can determine 
the last step of a right-most derivation by looking ahead of one symbol. So G 
is LR(1). We have also seen that S —> AB is a handle production for the 
sentential form AB, but not for ABb k . In other words, the handle production 
cannot be determined without looking ahead. So G is not LR(0). 

To get the derivation tree for a 2 b 4 . we scan a 2 b 4 from left to right. After 
scanning a. we look ahead. If the next symbol is a, we continue to scan. If 
the next symbol is b . we decide that A —> A is the required handle production. 
Thus the last step of the right-most derivation of orb 4 is 

ccAb 4 => a 2 Kb 4 

R 
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To get the last step of crAb 4 , we scan crAb 4 from left to right. aAb is a 
possible handle. We are able to decide that this is the right handle without 
looking ahead and so we get 

aAbb 2 => crAb 4 

R 

Once again using the handle aAb, we obtain 

Ab 1 => aAbb 2 

R 

To get the last step of the rightmost derivation of Ab 2 , we scan Ab 2 . A possible 
handle production is B —> b. We also note that this handle production can be 
applied to the first b we encounter, but not to the last b. So, we get 
ABb => Ab 2 . 

R 

For ABb, a possible ^-handle is Bb. Hence, we get AB ABb. Finally, 
we obtain S ^>A£. Thus we have the following derivations: 

a 2 Ab 4 => a 2 Ah 4 by looking ahead of one symbol 

R 

aAbb 2 => crAb 4 by not looking ahead of any symbol 

R 

Ab 2 => aAbb 2 by not looking ahead of any symbol 

R 

ABb Ab 2 by not looking ahead of any symbol 

AB ^ ABb by not looking ahead of any symbol 

S ^ AB by looking ahead of one symbol 

The derivation tree for crb 4 is as shown in Fig. 8.1. 


s 



Fig. 8.1 Derivation tree for a 2 b 4 


8.2 PROPERTIES OF LR (k) GRAMMARS 

In this section we give some important properties of LR (k) grammars which 
are useful for parsing and other applications. 
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Recall the definition of an ambiguous grammar. A grammar G is 
ambiguous if there exists w e L(G) which has two derivation trees. The next 
theorem gives the relation between LR(k) grammars and unambiguous 
grammars. 

Property 1 Every LR(T) grammar G is unambiguous. 

Proof We have to show that for any x e L*, there exists a unique right-most 
derivation. Suppose we have two rightmost derivations for x, namely 

S =e> aAw z=^aBw = x (8.1) 

* R 

S => of Aw' => a'b'w' = x (8.2) 

R R 

As afiw = a'fi'w from the definition it follows that a = a', A = A' and 
fi = fi'. As afiw = a'fi'w', we get w = w\ and so aAw = aTV. Hence the 
last step in the derivations (8.1) and (8.2) is the same. Repeating the arguments 
for the other sentential forms derived in the course of (8.1) and (8.2), we can 
show that (8.1) is the same as (8.2). Therefore. G is unambiguous. I 

We have seen that the deterministic and the nondeterministic finite 
automata behave in the same way in so far as acceptability of languages is 
concerned. Tire same is the case with Turing machines. But the behaviour of 
deterministic and nondeterministic pushdown automata is different. In 
Chapter 7 we have proved “that any pushdown automaton accepts a context- 
free language and for any context-free language L , we can construct a 
pushdown automaton accepting L. The following property gives the relation 
between LR(£) grammars and pushdown automata. 

Property 2 If G is an LR(£) grammar, there exists a deterministic pushdown 
automaton A accepting L(G). 

Property 3 If A is a deterministic pushdown automaton A, there exists an 
LR(1) grammar G such that L(G) = N(A). 

Property 4 If G is an LR(£) grammar, where k > 1, then there exists an 
equivalent grammar Gi which is LR(1). In so far as languages are concerned, 
it is enough to study the languages generated by LR(0) grammars and LR(1) 
grammars. 

Definition 8.2 A context-free language is said to be deterministic if it is 
accepted by a deterministic pushdown automaton. 

Property 5 The class of deterministic languages is a proper subclass of the 
class of context-free languages. 

The class of deterministic languages can be denoted by ^dcfl- 

Property 6 -T dcfl is closed under complementation but not under union and 
intersection. 

The following definition is useful in characterizing the languages accepted 
by an LR(O) grammar. 
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Definition 8.3 A context-free language has prefix property if no proper 
prefix of strings of L belongs to L. 

Property 7 A context-free language is generated by an LR{0) grammar if 
and only if it is accepted by a deterministic pushdown automaton and has 
prefix property. 

Property 8 There is an algorithm to decide whether a given context-free 
grammar is LR (k) for a given natural number k. 


8.3 CLOSURE PROPERTIES OF LANGUAGES 

We discussed closure properties under union, concatenation, and so on in 
Chapter 4. In this section we will discuss closure properties under intersection, 
complementation, etc. Recall that 4i are the families of type 0 

languages, context-sensitive languages, context-free languages and regular 
languages, respectively. 

Property 1 Each of the classes 1 , 4^4 is closed under union, 

concatenation, closure and transpose operations (Theorems 4.5^4.7). 

Property 2 i s closed under intersection and complementation (Theorems 
5.7 and 5.8). 

Property 3 / cfl is not closed under intersection and complementation. 

We establish property 3 by a counter-example. We have already seen that 
L\ - {a n b n c‘ | n > 1. / > 0} and L 2 - {crb n c u j n > 1, j > 0} are context-free 
languages (Examples 4.8 and 4.9). L\ n L 2 = { cfb n c n | n > 1}. In Example 6.18, 
we have shown that {a n b n c n | n > 1} is not context-free. Thus, =4 fl is not 
closed under intersection. 

Using DeMorgan's law 7 , we can w-rite L { n L 2 = (L[ u L£) r . We have 
proved in Chapter 4 that i cfl is closed under union. If x cfl w 7 ere closed under 
complementation, then L 2 n L 2 turns out to be context-free w ; hich is not true. 
Hence, i cfl is not closed under complementation. 

8.4 SUPPLEMENTARY EXAMPLES 


EXAMPLE 8.4 


Show ? that the grammar 5 —» a Ac, A —> Abb \ b is an LR(0) grammar. 

Solution 

It is easy to see that L(G) = {ab 2n+x c | n > 0}. The sentential forms of G are 
aAc, aAb 2u c, ab“ lt+l c. We consider the last production applied in the derivation 
of cib“ n+l c. As aAc ., Abb and b are the possible right-hand sides of productions, 
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only A b can be the last production in the rightmost derivation of cib 2n+l c. 
(We do not have aAc and Abb as substrings of ab 2n+] c). Similarly the last 
productions for aAc and aAb Zn c are S aAc and A Abb respectively. 
Hence G is an LR(0) grammar. 


EXAMPLE 8.5 


Show that S —> aAb, A —> cAc \ c is not LR(/r) for any natural number k. 

Solution 

It is easy to see that 

1(G) - {ac 2nM h j 77 > 0} 

Consider acccb e L(G). The last production is A o. But we can apply this 
handle only by knowing the entire string. This can be applied to the middle c 
but this is known only after looking at two symbols beyond the c which 
replaces A. Continuing this argument, we can decide the handle of ac 2nVi b by 
only looking at n + 1 symbols beyond the c which replaces A. So it is not 
LRik) for any k. 


EXAMPLE 8.6 


Give an example of a language which can be generated by an LR(k) grammar 
for some k and also by a grammar that is not LR(£) for any k. 

Solution 

Consider {ac 2n * l b | n > 0}. This is generated by the grammar S —> aAJo y 
A —> cAc j c which is not LR (k) for any A:. 

This language can also be generated by the grammar S -4 aAb , 
A Acc | c. This is LR(0). (This grammar is similar to the grammar in 
Example 8.4.) 


SELFTEST 

Choose the correct answer to Questions 1-5: 

1. An LR(£) grammar has to be 

(a) a type 0 grammar 

(b) a type 1 grammar 

(c) a type 2 grammar 

(d) none of these. 

An LR (k) grammar is 

(a) always unambiguous 

(b) always ambiguous 

(c) need not be unambiguous 

(d) none of these. 
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3. A handle is 

(a) a string of variables and terminals 

(b) a string of variables 
(e) a string of terminals 
(d) a production. 

4. The automaton corresponding to an LR (k) grammar is 

(a) a deterministic finite automaton 

(b) a nondeterministic finite automaton 

(c) a deterministic PDA 

(d) a nondeterministic PDA. 

5. £ dcf] is closed under 

(a) union 

(b) complementation 

(c) intersection 

(d) none of these. 

EXERCISES 

8.1 Show that the grammar S —> ciAb , A —> aAb \ a is an LR(1) or is it an 
LR(0)? 

8.2 Show- that the grammar S 0A2, A —» 141, A 1 is not an LR(0). 

8.3 Is S AB , S —> aA, A —> aA, A a, B —> a an LR(&) for some kl 

8.4 Show that {a m b m c n j m, n> 1} u {a m b n c n | m, n> 1} cannot be generated 
by an LR(£) grammar for any k. 

8.5 Are the following statements true? (a) If G is unambiguous, it is LR (k) 
for some k . (b) If G is unambiguous, it is LR (k) for every k. Justify your 
answer. 

8.6 Is S -> C | A C aC | b, D -> aD \ an LR(0)? 

8.7 For a production A —> /3 of a context-free grammar G and w in 

($ is a symbol not in V ;V u S), define R^w) to be the set of all strings 

of the form a(3w such that A —> /3 is a handle for afixvw' for some w' 

in X* $* and 5$ ==> ccAww' afiww'. (In other words, a string afiw is 

in R k (w) if we get a penultimate step of a rightmost derivation of afiww' 
for some w'.) Show that R^w) is a regular set. 

[Hint: Define G'= (L'y, V N u Z, P\ S'), where 

V N = {[A, w] | A € L v , vv e and |w| = k} 

S' - [S , $ k ] 
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Also show that each production in P induces a production in P / in the 
following manner: 

(a) If A —> x is in P, where a e I*, then [A, w] —> xw is included in 
P\ 

(b) If A —> X\X^ • • • X m is in P, Xy 6 V^ f Xj + \ . . . X m w w w for 
some w', w" in X*$* with \w'\ = k, then [A, w] —> ft ... 

[. Bj, \v'] is included in P' 

As the productions in P / have either a terminal string or a terminal string 
followed by a variable on R.H.S., G f can be reduced to an equivalent 
regular grammar. Use the principle of induction to show that 

[S* $ k ] % [A. w] if and only if for some w'S$ k =b> aAwnv 

g R 

This will establish KG') = ft(w). 

8.8 Prove that a context-free grammar G is an LR(P) if and only if the 

following holds: A string y in R k {w) corresponding to a production 
A x P\ is a substring of some element 8 in RjJ(w') corresponding to a 
production A 2 fa implies y = 8, A { = A 2? fa = ft. 

[7/m?: The proof follows from the definition of LR(Ari grammars.] 

8.9 Prove Property 2 of Section 8.2. 

Solution We give an outline of the construction of the required dpda 
A. A accepts a string w if S$ k remains in the stack after the processing 
of u\ For this purpose. A has to simulate the reverse derivation of w. 
This is achieved by finding suitable handles. R k (wys are defined 
precisely for this purpose (refer to Exercise 8.7). As R k (wY s are regular, 
there exist deterministic finite automata M k (w) corresponding to R k (\v). 

For our deterministic pda A to contain the information regarding the 
finite automata, the pushdown store of A is required to have an additional 
track. In the first track, symbols from V N u X are written or erased. In the 
additional track, the information regarding Mfawy s is stored in the form 
of maps. The map N a gives the states of finite automata M k (w) after the 
processing of a string a for all productions in G and strings w in X*$* 
of length k. The existence of a suitable handle is indicated by a final state 
of M k (w) (in the second track). 

We can describe the way A acts as follows: A is capable of reading 
k + 1 symbols on PDS, where l is the length of the longest R.H.S. of 
productions of G . (This can be achieved by modifying the finite control.) 
A reads the top m symbols on track 1 for some m < k + /. The second track 
is suitably manipulated. For example, if X] ... X m is in track 1, then 
track 2 stores the maps N X \N X \x 2 • • • ftvi ... xnv If a suitable handle is 
found, then a sentential form is obtained by replacing the R.H.S. of the 
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handle of its L.H.S. in the given string on track 1. As G is LR(£), this 
can be done in almost one way. 

The above process is repeated until it is no longer possible. If m 
symbols are not sufficient to carry out the process, more symbols are read 
and placed on the stack (track 1). 

If stack 1 has S$ k at a particular stage, A accepts the corresponding 
string. A is the required dpda accepting L(G). 




Turing Machines 
and Linear Bounded 
Automata 


In the early 1930s. mathematicians were trying to define effective computation. 
Alan Turing in 1936, Alanzo Church in 1933, S.C. Kleene in 1935, Schonfinkel 
in 1965 gave various models using the concept of Turing machines, A-calculus, 
combinatory logic, post-systems and //-recursive functions. It is interesting to 
note that these were formulated much before the electro-mechanical/electronic 
computers were devised. Although these formalisms, describing effective 
computations, are dissimilar, they turn to be equivalent. 

Among these formalisms, the Turing’s formulation is accepted as a model 
of algorithm or computation. The Church-Turing thesis states that any 
algorithmic procedure that can be carried out by human beings/computer can be 
carried out by a Turing machine. It has been universally accepted by computer 
scientists that the Turing machine provides an ideal theoretical model of a 
computer. 

Turing machines are useful in several ways. As an automaton, the Turing 
machine is the most general model. It accepts type-0 languages. It can also be 
used for computing functions. It turns out to be a mathematical model of partial 
recursive functions. Turing machines are also used for determining the un¬ 
decidability of certain languages and measuring the space and time complexity 
of problems. These are the topics of discussion in this chapter and some of the 
subsequent chapters. 

For formalizing computability, Turing assumed that, while computing, 
a person writes symbols on a one-dimensional paper (instead of a two- 
dimensional paper as is usually done) which can be viewed as a tape divided 
into cells. 

One scans the cells one at a time and usually performs one of the three 
simple operations, namely (i) writing a new symbol in the cell being currently 
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scanned, (ii) moving to the cell left of the present cell, and (iii) moving to the 
cell right of the present cell. With these observations in mind, Turing proposed 
his 'computing machine/ 

9.1 TURING MACHINE MODEL 

The Turing machine can be thought of as finite control connected to a R/W 
(read/write) head. It has one tape which is divided into a number of cells. The 
block diagram of the basic model for the Turing machine is given in Fig. 9.1. 



Each cell can store only one symbol. The input to and the output from the finite 
state automaton are effected by the R/W head which can examine one cell at 
a time. In one move, the machine examines the present symbol under the 
R/W head on the tape and the present state of an automaton to determine 

(i) a new symbol to be written on the tape in the cell under the R/W head, 

(ii) a motion of the RAY head along the tape: either the head moves one 
cell left (L). or one cell right (R), 

(iii) the next state of the automaton, and 

(iv) whether to halt or not. 

The above model can be rigorously defined as follows: 

Definition 9.1 A Turing machine M is a 7-tuple, namely (Q, E, F, S, q (y b , F), 
where 

1. Q is a finite nonempty set of states. 

2. F is a finite nonempty set of tape symbols, 

3. be T is the blank, 

4. Z is a nonempty set of input symbols and is a subset of T and b € I. 

5. <5 is the transition function mapping (tq , x) onto (q\ y, D) where D 
denotes the direction of movement of RAY head: D = L or R according 
as the movement is to the left or right. 

6. < 7 0 e 0 is the initial state, and 

7. F c Q is the set of final states. 
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Notes: (i) The acceptability of a stnng is decided by the reachability from the 
initial state to some final state. So the final states are also called the accepting 
states. 

(2) 8 may not be defined for some elements of Q x T. 

9.2 REPRESENTATION OF TURING MACHINES 

We can describe a Turing machine employing (i) instantaneous descriptions 
using move-relations, (ii) transition table, and (iii) transition diagram (transition 
graph). 

9.2.1 Representation by Instantaneous Descriptions 

'Snapshots' of a Turing machine in action can be used to describe a Turing 
machine. These give instantaneous descriptions' of a Turing machine. We have 
defined instantaneous descriptions of a pda in terms of the current state, the 
input string to be processed, and the topmost symbol of the pushdown store. 
But the input string to be processed is not sufficient to be defined as the ID of 
a Turing machine, for the R/W head can move to the left as well. So an ID of a 
Turing machine is defined in terms of the entire input string and the current 
state. 

Definition 9.2 An ID of a Turing machine M is a string aj3y , where [5 is the 
present state of M, the entire input string is split as ay the first symbol of y is 
the current symbol a under the R/W head and yhas all the subsequent symbols 
of the input string, and the string a is the substring of the input string formed 
by all the symbols to the left of a . 


EXAMPLE 9.1 


A snapshot of Turing machine is shown in Fig. 9.2. Obtain the instantaneous 
description. 


\ 

b 

a 4 


a 2 

a 1 

a 2 

a 2 

a i 

a 4 

a 2 

b 

b 




R/W head 


State 

93 



Fig. 9,2 A snapshot of Turing machine. 


Solution 

The present symbol under the R/W head is a x . The present state is c\y So a { 
is written to the right of q 3 . The nonblank symbols to the left of a { form the 
string a^a x aiaxa z a 2 , which is written to the left of q 3 . The sequence of nonblank 
symbols to the right of a x is Thus the ID is as given in Fig. 9.3. 
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Symbol under 
R/W head 


Fig. 9.3 Representation of ID. 


Notes: (1) For constructing the ID, we simply insert the current state in the 

input string to the left of the symbol under the R/W head. 

(2) We observe that the blank symbol may occur as part of the left or right 
substring. 

Moves in a TM 

As in the case of pushdown automata, 8(q, x) induces a change in ID of the 
Turing machine. We call this change in ID a move. 

Suppose S(q, x-) = ( p , v, L). The input string to be processed is x x x 2 ■ • . x m 
and the present symbol under the R/W head is x t . So the ID before processing 
Xi is 

* 1*2 . . . xi_ } qxi . . . x n 
After processing x h the resulting ED is 

x\ ■ ■ • x i-2 P x i-\y*M • • * Xn 
This change of ID is represented by 

X\X 2 • . . q x i • • • x n |— Xi . . . Xi _2 px { _ x yx M . . . x n 

If i - 1, the resulting ID is pyx 2 x 3 . . . x n . 

If 8(q , Xj) = (p, y, R ), then the change of ED is represented by 

X\x 2 . . . x^qx; . . . x n x x x 2 . . . x^ypxi+x . . . x n 

If i = n , the resulting ED is x\x 2 . . . x n _ x y p b . 

We can denote an ED by 7 ; - for some j. Ij )— l k defines a relation among EDs. 
So the symbol denotes the reflexive-transitive closure of the relation |—. 
In particular, Ij p- Ij. Also, if l x p- then we can split this as /] |— h \— 
I 2 \— ... |— I n for some EDs, / 2 , . . ., I n „\. 

Note: The description of moves by EDs is very much useful to represent the 
processing of input strings. 


9.2.2 Representation by Transition Table 

We give the definition of 8 in the form of a table called the transition table. If 
8(q , a) = (y, a. /3), we write a/3y under the a-column and in the r/-row. So if 
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we get apy in the table, it means that a is written in the current cell, /3 gives 
the movement of the head (L or R) and y denotes the new state into which the 
Turing machine enters. 

Consider, for example, a Turing machine with five states q u . . q 5 , where 

t/! is the initial state and q 5 is the (only) final state. The tape symbols are 0, 1 
and b . The transition table given in Table 9.1 describes 8l 


TABLE 9.1 Transition Table of a Turing Machine 


Present state 

b 

Tape symbol 

0 

1 

-*7i 

1 Lc ? 2 

0 Rq\ 


C /2 

bRq 3 

0Lq 2 

1 Lq 2 

Qz 


bRq 4 

bRq s 

94 

0Rq s 

ORq, 


© 

0Lq 2 




As in Chapter 3, the initial state is marked with -> and the final state 
with O. 


EXAMPLE 9.2 


Consider the TM description given in Table 9.1. Draw the computation 
sequence of the input string 00 . 

Solution 

We describe the computation sequence in terms of the contents of the tape and 
the current state. If the string in the tape is a\a 2 . . . 1 . . . a m and the TM 

in state q is to read aj + \, then we write a x a 2 ... cijq cij + \ . . . a„ r 
For the input string 006, we get the following sequence: 

< 7[006 |— 0 < 7[06 \— 00q { b f— 0 * 7 2 01 (— * 7 2 001 

|— q 2 b00l j— 6*73001 |— 66*7401 |— 669*741 j— 6691*746 
|— 66010*7 5 6601 < 7 2 00 |— 660 * 7 2 100 |— 66 * 7 2 0100 

|—6*7 2 60100 I— 66 * 7 3 0100 |— 666^100 |— 666[r/ 4 00 

(— 66610</ 4 0 j— 666100*746 (— 6661000*? 5 6 
|— 666100 * 7 2 00 I— 66610 < 7 2 000 [— bbblq 2 0000 
|— 666 < 7 : 10000 |— 66 * 7 2 610000 |— 666^10000 bbbbq 5 0000 

9.2.3 Representation by Transition Diagram 

We can use the transition systems introduced in Chapter 3 to represent Turing 
machines. The states are represented by vertices. Directed edges are used to 
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represent transition of states. The labels are triples of the form ( a ; /?, y), where 
a. j3. e T and ye {L. R}. When there is a directed edge from to qj with label 
(or, /?, y), it means that 

«) = (<7r A 7) 

During the processing of an input string, suppose the Turing machine enters 
q } and the R/W head scans the (present) symbol a. As a result, the symbol /3 
is written in the cell under the R/W head. The R/W head moves to the left or 
to the right, depending on y and the new state is q y 

Every edge in the transition system can be represented by a 5-tuple (q h a , 
/J, y < 7 /). So each Turing machine can be described by the sequence of 5-tuples 
representing all the directed edges. The initial state is indicated by —> and any 
final state is marked with O. 


EXAMPLE 9.3 


M is a Turing machine represented by the transition system in Fig. 9.4. Obtain 
the computation sequence of M for processing the input string 0011. 


(b, b, R) 



Solution 

The initial tape input is bOOllb. Let us assume that M is in state q x and the 
R/W head scans 0 (the first 0). We can represent this as in Fig. 9.5. The figure 
can be represented bv 

i 

bOOllb 

Q\ 

From Fig. 9.4 we see that there is a directed edge from to <72 with the label 
(0. v. R). So the current symbol 0 is replaced by x and the head moves right. 
The new state is qy Thus, w'e get 


i 

bxOllb 

Q2 
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The change brought about by processing the symbol 0 can be represented as 

i i 

bOOUb bxOllb 

<7i <72 



-! 

t, 1 

-, 

° J 

0 

1 

1 

b 




R/W head 


State 

<7l 



Fig. 9.5 TM processing 0011. 


The entire computation sequence reads as follows: 


i 

(0.x *R) 

- -i_> 

i 

(0.0.K) 

1 

bOOllb - 

bxOllb 

b.\01 lb 

<7i 


<72 


<72 


1 


1 

1 

(l.y.Z.) v 

bxOylb 

(O.O.Z.) v 

foOy1 b 
<74 

(.v-v-R) / nflvlfo 

) 

<73 

■. )► 

<7l 


(O.x.R) ( y.y.R ) , 17 (i-yX) 7 

- > bxxylb - > bxxvlb - > bxxyyb 


i 

vl l 
Q2 


i 

'cyy 

<73 


(y.y.L) ^ f x*x.R) 7 ^ (\\y.R) ^ . 

-* bxxyyb - > bxxyyb - > bxxyyb 




<75 


(y.y.R) 


i 

» bxxyyb 
<35 


(b,b,R) 


S' 


*bxxyybb 
' d6 


93 LANGUAGE ACCEPTABILITY BY TURING 
MACHINES 

let us consider the Turing machine M = (Q, F, 5, cjq, O* A string w in 
E* is said to be accepted by M if q 0 w p- a\pa 2 for some p e F and a 2 

€ r* 

M does not accept w if the machine M either halts in a non accepting state 
or does not halt. 
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It may be noted that though there are other equivalent definitions of 
acceptance by the Turing machine, we will be not discussing them in this text. 


EXAMPLE 9.4 


Consider the Turing machine M described by the transition table given in 
Table 9.2. Describe the processing of (a) Oil, (b) 0011, (c) 001 using IDs. 
Which of the above strings are accepted by Ml 


TABLE 9.2 Transition Tabie for Example 9.4 


Present state 



Tape symbol 



0 

1 

X 

y 

b 

<7i 

xRq 2 




bRqs 

<?2 

0Rq 2 

yl-q 3 


yRq 2 


<73 

0 Lq A 


xRq 5 

yLq 3 


<74 

0 Lq A 


xRq i 



95 




yxRqs 

bRq e 

® 







Solution 

(a) <7]011 |— xq *) 11 j— Cj%xyl j— xq^y 1 j — xyq^t 

As S(q 5 . 1) is not defined. M halts; so the input string Oil is not accepted. 

(b) r^OOll |— xq$ 11 |— xO^ll |— xqftyl |— qixOyl [— xq^Oyl. 

|—|— xy>r/21 |— xxqyyy |— xq&yy |— xxc isyy 

|— xx yQsy 1— a xyyq 5 b |— xxyybq 6 

M halts. As q $ is an accepting state, the input string 0011 is accepted by M. 

(c) ^001 |— .vq^Ol |— xOq^l |— -W 3 0v |— q$Qy 

|— xq | Ov j — xxqy\' J— xxyq 2 

M halts. As q 2 is not an accepting state, 001 is not accepted by M. 

9.4 DESIGN OF TURING MACHINES 

We now give the basic guidelines for designing a Turing machine. 

(i) The fundamental objective in scanning a symbol by the R/W head is 
to ‘know' what to do in the future. The machine must remember the 
past symbols scanned. The Turing machine can remember this by 
going to the next unique state. 

(ii) The number of states must be minimized. This can be achieved by 
changing the states only when there is a change in the written symbol 
or when there is a change in the movement of the R/W head. We shall 
explain the design by a simple example. 



Chapter 9: Turing Machines and Linear Bounded Automata B 285 


EXAMPLE 9.5 


Design a Turing machine to recognize all strings consisting of an even number 
of Fs. 

Solution 

The construction is made by defining moves in the following manner: 

(a) q { is the initial state. M enters the state q 2 on scanning 1 and writes b. 

(b) If M is in state q 2 and scans 1, it enters q , and writes b. 

(c) q ] is the only accepting state. 

So M accepts a string if it exhausts all the input symbols and finally is in 
state q\. Symbolically, 


M = ({</„ q 2 }, {L b }, {1, b }, , 

5, q, b. {<?,}) 

defined by Table 9.3. 


TABLE 9.3 Transition Table for Example 9.5 

Present state 

1 


bq 2 R 

Q2 

bq*R 


Let us obtain the computation sequence of 11. Thus, q x \l |— bq 2 1 |— bbq\. 
As qi is an accepting state. 11 is accepted. q x l 11 |— bq 2 ll |— bbq x 1 \~~ bbbq 2 . 
M halts and as q 2 is not an accepting state, 111 is not accepted by M. 


EXAMPLE 9.6 


Design a Turing machine over {1, b} which can compute a concatenation 
function over I - {!}. If a pair of words (w h vr 2 ) is the input, the output has 
to be wpc 2 . 

Solution 

Let us assume that the two words w x and w 2 are written initially on the input 
tape separated by the symbol b. For example, if viq - 11 , w 2 = 111 , then the 
input and output tapes are as shown in Fig. 9.6. 



□ 

0 

□ 

□ 

M 

b i 

0 

E 

E 

1 

E 

E 

b 


Fig. 9.6 Input and output tapes. 



We observe that the main task is to remove the symbol b . This can be done 
in the following manner: 

(a) The separating symbol b is found and replaced by 1. 
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(b) The rightmost 1 is found and replaced by a blank b. 

(c) The R/W head returns to the starting position. 

A computation is illustrated in Table 9,4. 

TABLE 9.4 Computation for 11 b\ 11 

( 7 0 11 b 111 |— 1q 0 1b1t1 |— 111Qi111 

|— 1111q 1 11 1— 11111 q-,1 1— 111111^5 |— 11111^2^ 
h- m\q^bb |— 111q 3 1 1 bb |— 11q 3 111jbjb [- Iq^mbb 
1— cfe11111M> |— <7 3 h11111/3jb h- ^Q f 11111 jb£) 

From the above computation sequence for the input string 11M11, we can 
construct the transition table given in Table 9.5. 

For the input string 1M, the computation sequence is given as 

q^lbl |— lq$bl |— 1 It/i 1 |— lli c\\b |— 11^ |— lq$lbb 

|— q$ 11 bb |— q*b\ 1 bb |— bqjllbb. 


TABLE 9,5 Transition Table for Example 9.6 


Present state 

Tape symbol 

1 b 

~^Qo 

1 R<7o 

IRqfi 

<7i 

IRqh 

bLq 2 

<72 

bLq z 

— 

<73 

1/-% 

bRq f 

© 

— 

— 


EXAMPLE 9.7 


Design a TM that accepts 

{0”V l \n > 1 }. 


Solution 

We require the following moves: 

(a) If the leftmost symbol in the given input string w is 0, replace it by x 
and move right till we encounter a leftmost 1 in w. Change it to y and 
move backwards. 

(b) Repeat (a) with the leftmost 0. If we move back and forth and no 0 or 
1 remains, move to a final state. 

(c) For strings not in the form OH”, the resulting state has to be nonfinal. 



Chapter 9: Turing Machines and Linear Bounded Automata S 287 


Keeping these ideas in our mind, we construct a TM M as follows: 

M= (Q, X, I\ 5, q 0 , b. F) 

where 

Q = {<?o- <?i> 42 ’ <?3> qf) 

f= {<?,} 

1 = { 0 . 1 } 

r = {0, 1, x. y, b } 

The transition diagram is given in Fig. 9.7. M accepts {0"1"| n >1}. The moves 
for 0011 and 010 are given below just to familiarize the moves of M to the 
reader. 



Fig. 9.7 Transition diagram for Example 9.7. 


g 0 0011 |—x^Oll |—.rO^ll |— xq 2 0yl 

|— # 2 x 0 yl |— xq$y\ j— xxcj\y\ |— xxyq x 1 
|— xxqyyy |— xq 2 xyy |— xxqqyy |— xxyqyy 
|— xxyyq^ = xxyyq^b |— xxyybqj? 

Hence 0011 is accepted by M. 

q^OlO |— xqi 10 |— q 2 *y0 |— xqqyO |— xyq $0 
As S(q 2 . 0) is not defined, M halts. So 010 is not accepted by M. 


-EXAMPLE 9.8 


Design a Turing machine M to recognize the language 

{1"2"3" [n >1}. 
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Solution 

Before designing the required Turing machine M, let us evolve a procedure for 
processing the input string 112233. After processing, we require the ID to be 
of the form bbbbbbq-. The processing is done by using five steps: 

Step 1 q } is the initial state. The R/W head scans the leftmost 1, replaces 1 
by b, and moves to the right. M enters q 2 . 

Step 2 On scanning the leftmost 2, the R/W head replaces 2 by b and moves 
to the right. M enters q 2 . 

Step 3 On scanning the leftmost 3, the R/W head replaces 3 by b , and moves 
to the right. M enters q 4 . 

Step 4 After scanning the rightmost 3, the R/W heads moves to the left until 
it finds the leftmost 1. As a result, the leftmost 1. 2 and 3 are replaced by b. 

Step 5 Steps 1-4 are repeated until all l's, 2 ! s and 3’s are replaced by blanks. 
The change of IDs due to processing of 112233 is given as 

<7j112233 |— Z*? : 12233 |— b\q 2 2233 |— blbq 3 233 |— b\b2q{33 
|— b\b2bq 4 3 |— b\b 2 q$b3 |— b\bq{lb3 [— h\q$b2b2 |— bq 5 lb2b3 
|— q§b\b2b3 |— bq\lb2b3 |— bbq 2 b2b3 |— bbbq 2 2b3 
|— bbbbq 2 b3 |— bbbbbq{3 |— bbbbbbq 4 b j— bbbbbq 1 bb 

Thus. 

q\ 112233 qibbbbbb 

As q 2 is an accepting state, the input string 112233 is accepted. 

Now we can construct the transition table for M. It is given in Table 9.6. 


TABLE 9.6 Transition Table for Example 9.7 


Present state 


Input tape symbol 



i 

2 

3 

b 

-»<7i 

bRq 2 



bRq i 

Q2 

1*92 

bRq 3 


bRq 2 

Qz 


2*93 

bRq 4 

bRq 3 

94 



3/-q> 5 

bLq-, 

95 

V-q-o 

2 Lq s 


bLq s 

cn [rC\ 

V-q 6 



bRq i 


It can be seen from the table that strings other than those of the form 0"1 ;I 2" 
are not accepted. It is advisable to compute the computation sequence for 
strings like 1223, 1123, 1233 and then see that these strings are rejected by M. 
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9.5 DESCRIPTION OF TURING MACHINES 

In the examples discussed so far, the transition function 8 was described as a 
partial function (function 8 : Q x T g x T x {L i?} is not defined for all 
(i q , x)) by spelling out the current state, the input symbol, the resulting state, the 
tape symbol replacing the input symbol and the movement of R/W head to the 
left or right. We can call this a formal description of a TM. Just as we have the 
machine language and higher level languages for a computer, we can have a 
higher level of description, called the implementation description. In this case 
we describe the movement of the head, the symbol stored etc. in English. For 
example, a single instruction like ‘move to right till the end of the input string’ 
requires several moves. A single instruction in the implementation description 
is equivalent to several moves of a standard TM (Hereafter a standard TM 
refers to the TM defined m Definition 9.1). At a higher level we can give 
instructions in English language even without specifying the state or transition 
function. This is called a high-level description. 

In the remaining sections of this chapter and later chapters, we give 
implementation description or high-level description. 


9.6 TECHNIQUES FOR TM CONSTRUCTION 

In this section we give some high-level conceptual tools to make the 
construction of TMs easier. The Turing machine defined in Section 9.1 is called 
the standard Turing machine. 

9.6.1 Turing Machine with Stationary Head 

In the definition of a TM we defined 8(q, a) as (q\ y, D) where D - L or R. 
So the head moves to the left or right after reading an input symbol. Suppose, 
we want to include the option that the head can continue to be in the same cell 
for some input symbol. Then we define 8(q , a) as (q\ y, S). This means that 
the TM, on reading the input symbol a , changes the state to q and writes y in 
the current cell in place of a and continues to remain in the same cell. In terms 
of IDs, 

wqcix |— wc/yx 

Of course, this move can be simulated by the standard TM with two moves, 
namely 

wqcix |— wyq'x |— wc/yx 

That is, 8{q , a) = (q\ y, S ) is replaced by S(cj. a) = (q", y, R) and 8(q \ X) = 
(q . v, L) for any tape symbol X. 

Thus in this model 8{cp a) - (q\ y, D) where D = L, R or S . 
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9.6.2 Storage in the State 

We are using a state, whether it is of a FA or pda or TM, to ‘remember’ things. 
We can- use a state to store a symbol as well. So the state becomes a pair 
( q , a) where q is the state (in the usual sense) and a is the tape symbol stored 
in ( q , a). So the new set of states becomes QxF 


EXAMPLE 9.9 


Construct a TM that accepts the language 0 1* + 1 0*. 

Solution 

We have to construct a TM that remembers the first symbol and checks that it 
does not appear afterwards in the input string. So we require two states, q 0 , q 
The tape symbols are 0, 1 and b. So the TM, having the ‘storage facility in 
state’, is 

M = ({ c]q, q x ) X {0. 1, ft}. {0, 1}, {0, 1, b}. 8, [<?„, b). {[«?!, b ]}) 

We describe S by its implementation description. 

1. In the initial state, M is in q 0 and has b in its data portion. On seeing 
the first symbol of the input sting w, M moves right, enters the state 
q { and the first symbol, say a, it has seen. 

2. M is now in [q h a], (i) If its next symbol is b , M enters [q h b], an 
accepting state, (ii) If the next symbol is a . M halts without reaching 
the final state (i.e. S is not defined), (lii) If the next symbol is a 
(a = 0 if a = 1 and a = 1 if a = 0), M moves right without changing 
state. 

3. Step 2 is repeated until M reaches [q h b] or halts ( 8 is not defined for 
an input symbol in w), 

9.6.3 Multiple Track Turing Machine 

In the case of TM defined earlier, a single tape was used. In a multiple track 
TM. a single tape is assumed to be divided into several tracks. Now the tape 
alphabet is required to consist of A:-tuples of tape symbols, k being the number 
of tracks. Hence the only difference between the standard TM and the TM with 
multiple tracks is the set of tape symbols. In the case of the standard Turing 
machine, tape symbols are elements of F; in the case of TM with multiple track, 
it is r A . The moves are defined in a similar way. 


9.6.4 Subroutines 

We know that subroutines are used in computer languages, when some task has 
to be done repeatedly. We can implement this facility for TMs as well. 
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First, a TM program for the subroutine is written. This will have an initial 
state and a 'return* state. After reaching the return state, there is a temporary 
halt. For using a subroutine, new states are introduced. When there is a need 
for calling the subroutine, moves are effected to enter the initial state for the 
subroutine (when the return state of the subroutine is reached) and to return to 
the main program of TM. 

We use this concept to design a TM for performing multiplication of two 
positive integers. 


EXAMPLE 9.10 


Design a TM which can multiply two positive integers. 

Solution 

The input (m, n). m, n being given, the positive integers are represented by 
M starts with O'” 10" in its tape. At the end of the computation, 
0 lwl (mn in unary representation) surrounded by V s is obtained as the ouput. 
The major steps in the construction are as follows: 

1. 0"T0M is placed on the tape (the output will be written after the 
rightmost 1). 

2. The leftmost 0 is erased, 

3. A block of n 0's is copied onto the right end. 

4. Steps 2 and 3 are repeated m times and 10'” 10'”” is obtained on the 
tape. 

5. The prefix 10”'l of 10”'10'”” is erased, leaving the product ran as the 
output. 

For every 0 in O'". 0” is added onto the right end. This requires repetition 
of step 3. We define a subroutine called COPY for step 3. 

For the subroutine COPY, the initial state is q\ and the final state is r/ 5 . 8 
is given by the transition table (see Table 9.7). 


TABLE 9,7 Transition Table for Subroutine COPY 


State 


Tape symbol 


0 

t 

2 

b 

9i 

q 2 2R 

q A V- 

— 

— 

92 

q 2 0R 

q 2 1R 

— 

9s0 L 

93 

9 3 0L 

93lL 

q^R 

— 

94 

— 

Q5IR 

q 4 0L 

— 

95 

— 

— 

— 

— 


The Turing machine M has the initial state q 0 . The initial ID for M is 
<r/ 0 0'"10'T. On seeing 0, the following moves take place (q (l is a state of M). 
f/oO”'10' ! l |— bqfl"- 1 10"! P- bO m -'q 6 lO’'l (— c h is the initial state 
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of COPY. The TM M x performs the subroutine COPY. The following moves 
take place for Mj: |—2^0"“ 1 1 p- |—20"“ l t/ 3 10 p_ 2^0"^ 10. 

After exhausting 0's. q x encounters 1. M\ moves to state c/ 4 . All 2’s are 
converted back to 0’s and M\ halts in q 5 . The TM M picks up the computation 
by starting from q 5 . The q 0 and q 6 are the states of M. Additional states are 
created to check whether each 0 in 0 m gives rise to 0 ; " at the end of the 
rightmost 1 in the input string. Once this is over, M erases 10"! and finds 0™ 
in the input tape. 

M can be defined by 

M = ({< 70 - <7b • • ■. <7i2h {0. 1}, {0. 1, 2, b}, 3, q 0 , b, {q ]2 }) 
where 8 is defined by Table 9.8. 


TABLE 9.8 Transition Table for Example 9.10 



0 

1 

2 

b 

Qo 

q e bR 

— 

— 

— 

Qe 

q e 0R 

q,1R 

— 

— 

Qs 

q?0L 

~ 

— 

— 

Qi 

— 

L 

— 

— 

Qb 

QqOL 


— 

<hobR 

Qs 

QqOL 

— 

— 

q 0 bR 

9 io 

— 

q«bR 

— 

— 

Qii 

q Vi bR 

q V2 bR 

— 

— 


Thus M performs multiplication of two numbers in unary representation. 

97 VARIANTS OF TURING MACHINES 

The Turing machine we have introduced has a single tape. S(q , a) is either a 
single triple (p. y. D), where D = R or L, or is not defined. We introduce two 
new models of TM: 

(i) a TM with more than one tape 

(ii) a TM where S(q, a ) = {( p x , yj, Dj), [p 2 , > 2 * D 2 ), . . ( p n y n D r )}. The 
first model is called a multitape TM and the second a nondeterministic 
TM. 

9.7.1 Multitape Turing Machines 

A multitape TM has a finite set Q of states, an initial state q& a subset F of Q 
called the set of final states, a set P of tape symbols, a new symbol b, not in 
P called the blank symbol. (We assume that IcF and b & X.) 
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There are k tapes, each divided into cells. The first tape holds the input 
string w. Initially, all the other tapes hold the blank symbol. 

Initially the head of the first tape (input tape) is at the left end of the input 
w. All the other heads can be placed at any cell initially. 

5 is a partial function from Q x T k into Q x F k x {L, R . S} k . We use 
implementation description to define <5. Figure 9.8 represents a multitape TM. 
A move depends on the current state and k tape symbols under k tape heads. 



Fig. 9.8 Multitape Turing machine. 


In a typical move: 

(i) M enters a new state, 

(ii) On each tape, a new symbol is written in the cell under the head. 

(iii) Each tape head moves to the left or right or remains stationary. The 
heads move independently: some move to the left, some to the right 
and the remaining heads do not move. 

The initial ID has the initial state q 0t the input string w in the first tape 
(input tape), empty strings of Z/s in the remaining k- 1 tapes. An accepting ID 
has a final state, some strings in each of the k tapes. 

Theorem 9.1 Every language accepted by a multitape TM is acceptable by 
some single-tape TM (that is, the standard TM). 

Proof Suppose a language L is accepted by a k -tape TM M. We simulate M 
with a single-tape TM with 2k tracks. The second, fourth. .... (2£)th tracks hold 
the contents of the k-tapes. The first, third, . . (2k - l)th tracks hold a head 

marker (a symbol say X) to indicate the position of the respective tape head. 
We give an 'implementation description* of the simulation of M with a single¬ 
tape TM M\. We give it for the case k = 2. The construction can be extended 
to the general case. 

Figure 9.9 can be used to visualize the simulation. The symbols A 2 and B 5 
are the current symbols to be scanned and so the headmarker X is above the two 
symbols. 
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Fig. 9.9 Simulation of multitape TM. 


Initially the contents of tapes 1 and 2 of M are stored in the second and 
fourth tracks of M x . The headmarkers of the first and third tracks are at the cells 
containing the first symbol. 

To simulate a move of JVL the 2£-track TM M x has to visit the two 
headmarkers and store the scanned symbols in its control. Keeping track of the 
headmarkers visited and those to be visited is achieved by keeping a count and 
storing it in the finite control of M x . Note that the finite control of M x has also 
the information about the states of M and its moves. After visiting both head 
markers, M x knows the tape symbols being scanned by the two heads of M. 
Now M { revisits each of the headmarkers: 

(i) It changes the tape symbol in the corresponding track of M x based 
on the information regarding the move of M corresponding to the state 
(of M) and the tape symbol in the corresponding tape M. 

(ii) It moves the headmarkers to the left or right. 

(iii) M | changes the state of M in its control. 

This is the simulation of a single move of M. At the end of this, M x is ready 
to implement its next move based on the revised positions of its headmarkers 
and the changed state available in its control. 

M x accepts a string \v if the new state of M, as recorded in its control at 
the end of the processing of u\ is a final state of M. 

Definition 9.3 Let M be a TM and w an input string. The running time of M 
on input h\ is the number of steps that M takes before halting. If M does not 
halt on an input string u\ then the running time of M on w is infinite. 

Note: Some TMs may not halt on all inputs of length n. But we are interested 
in computing the running time, only when the TM halts. 

Definition 9.4 The time complexity of TM M is the function T(n), n being the 
input size, where T(n) is defined as the maximum of the running time of M over 
all inputs w of size n. 

Theorem 9.2 If M x is the single-tape TM simulating multitape TM M, then 
the time taken by M x to simulate n moves of M is 0(n 2 ). 
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Proof Let M be a &-tape TM. After n moves of M, the head markers of M 1 
will be separated by 2n cells or less. (At the worst, one tape movement can be 
to the left by n cells and another can be to the right by n cells. In this case the 
tape headmarkers are separated by 2n cells. In the other cases, the ‘gap’ 
between them is less). To simulate a move of M, the TM M x must visit all the 
k headmarkers. If M starts with the leftmost headmarker, M { will go through all 
the headmarkers by moving right by at most 2 n cells. To simulate the change 
in each tape. M { has to move left by at most 2 n cells; to simulate changes in 
k tapes, it requires at most two moves in the reverse direction for each tape. 

Thus the total number of moves by M x for simulating one move of M is 
atmost 4 n + 2 k. (2n moves to right for locating all headmarkers, 2 n + 2k moves 
to the left for simulating the change in the content of k tapes.) So the number 
of moves of M x for simulating n moves of M is n(4n + 2k). As the constant k 
is independent of ;?, the time taken by M x is O(rr). 

9.7.2 Nondeterministic Turing Machines 

In the case of standard Turing machines (hereafter we refer to this machine as 
deterministic TM). 8{q x . a) was defined (for some elements of Q x F) as an 
element of Q x T x {L, R}. Now we extend the definition of 5. In a 
nondeterministic TM. 8(q\, a) is defined as a subset of Q x T x {L, R}. 

Definition 9.5 A nondeterministic Turing machine is a 7-tuple (Q , Z, T, <5, q$ , 
b, F) where 

1 . O is a finite nonempty set of states 

2 . T is a finite nonempty set of tape symbols 

3. be r is called the blank symbol 

4. Z is a nonempty subset of T. called the set of input symbols. We 
assume that b £ Z. 

5. <70 is the initial state 

6 . F c Q is the set of final states 

7. <5 is a partial function from Q x F into the power set of Q x T x 
{L. R}. 

Note: If q e Q and .v € T and 8(q. x) = {(q b y b D } ). (q 2 , D 2 ), . . 
(q n . y, r D n )} then the NTM can chose any one of the actions defined by 
(q h V/, D;) for i = 1, 2. . . .. n. 

We can also express this in terms of |— relation. If 8{q , x) = {(q h y h D f )| 
i=L2....,n| then the ID zqxw can change to any one of the n IDs specified 
by the 7 ?-element set 8(q , a). 

Suppose S(q, x) = {(q h y i? L), (q 2 , y 2 . R ). (q 2 , y 3 , L)}. Then 

• . . Zi$XZk+\ ••*-/?!— -IO • ■ • ^k-\ c i\<y\~-k+\ * ■ • -H 
or 

-1-2 * * • ZkCjXZ M . . . z n j— :io ■ • • -DC^/2-A+l ■■•-/? 

-1-2 • ■ • — -1-2 • * ■ -A-ri/3-A>3-/:+l ■ • • £/r 


or 
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So on reading the input symbol, the NTM M whose current ED is z\Z 2 * • * 
z^qxzk+i • . • z n can change to any one of the three IDs given earlier. 

Remark When 8(q, x) = {(q h y P D t ) \ i = 1, 2, . . .. n} then NTM chooses any 
one of the n triples totally (that is, it cannot take a state from one triple, another 
tape symbol from a second triple and a third D(L or R) from a third triple, etc. 

Definition 9.6 w e X* is accepted by a nondeterministic TM M if q$w 
xq f y for some final state cjj. 

The set of all strings accepted by M is denoted by T(M). 

Note: As in the case of NDFA, an ID of the form xqy (for some q € F) may 
be reached as the result of applying the input string w\ But w is accepted by M 
as long as there is some sequence of moves leading to an ED with an accepting 
state. It does not matter that there are other sequences of moves leading to an 
ED with a nonfinal state or TM halts without processing the entire input string. 

Theorem 9.3 If M is a nondeterministic TM. there is a deterministic TM M\ 
such that T(M) = T{M X ). 

Proof We construct M 1 as a multitape TM. Each symbol in the input string 
leads to a change in ID. M x should be able to reach all EDs and stop when an 
ED containing a final state is reached. So the first tape is used to store IDs of 
M as a sequence and also the state of M. These EDs are separated by the symbol 
* (included as a tape symbol). The current ED is known by marking an x along 
with the ED-separator * (The symbol * marked with x is a new tape symbol.) 
All IDs to the left of the current one have been explored already and so can be 
ignored subsequently. Note that the current ID is decided by the current input 
symbol of w. 

Figure 9.10 illustrates the deterministic TM M x . 



Fig. 9.10 The deterministic TM simulating M. 


To process the current ED. M\ performs the following steps. 

1. M\ examines the state and the scanned symbol of the current ED. Using 
the knowiedge of moves of M stored in the finite control of M j, M x 
checks whether the state in the current ED is an accepting state of M. 
In this case M\ accepts and stops simulating M. 
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2. If the state q say in the current ID xqciy is not an accepting state of M, 
and S(q, a) has k triples, M x copies the ID xqciy in the second tape and 
makes k copies of this ID at the end of the sequence of IDs in tape 2. 

3. M] modifies these k IDs in tape 2 according to the k choices given by 
8(q. a). 

4. Mi returns to the marked current ID. erases the mark x and marks the 
next ID-separator * with x (to the * which is to the left of the next ID 
to be processed). Then M x goes back to step 1. 

M\ stops when an accepting state of M is reached in step 1. 

Now M { accepts an input string w only when it is able to find that M has 
entered an accepting state, after a finite number of moves. This is clear from 
the simulated sequence of moves of M x (ending in step 1) 

We have to prove that M x will eventually reach an accepting ID (that is, 
an ID having an accepting state of M) if M enters an accepting ID after n 
moves. Note each move of M is simulated by several moves of M x . 

Let in be the maximum number of choices that M has for various (< 7 , a)’s. 
(It is possible to find m since we have only finite number of pairs in Q x T.) 
So for each initial ED of M, there are at most m IDs that M can reach after one 
move, at most nr IDs that M can reach after two moves, and so on. So 
corresponding to n moves of M, there are at most 1 + m + nr + ■ • • + m n moves 
of Mj. Hence the number of IDs to be explored by M\ is at most nm u . 

We assume that M x explores these IDs. These IDs have a tree structure 
having the initial ID as its root. We can apply breadth-first search of the nodes 
of the tree (that is. the nodes at level 1 are searched, then the nodes at level 2 , 
and so on.) If M reaches an accepting ID after n moves, then M x has to search 
atmost nnf IDs before reaching an accepting ID. So, if M accepts w, then M x 
also accepts vr (eventually). Hence T(M) = T(M X ). 

9.8 THE MODEL OF LINEAR BOUNDED AUTOMATON 

This model is important because (a) the set of context-sensitive languages is 
accepted by the model, and (b) the infinite storage is restricted in size but not 
in accessibility to the storage in comparison with the Turing machine model. It 
is called the linear bounded automaton (LBA) because a linear function is used 
to restrict {to bound) the length of the tape. 

In this section we define the model of linear bounded automaton and 
develop the relation between the linear bounded automata and context-sensitive 
languages. It should be noted that the study of context-sensitive languages is 
important from practical point of view because many compiler languages lie 
between context-sensitive and context-free languages. 

A linear bounded automaton is a nondeterministic Turing machine which 
has a single tape whose length is not infinite but bounded by a linear function 
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of the length of the input string. The models can be described formally by the 
following set format: 

M = (Q. I, r. <5. q 0 , b, (fc $, F) 

All the symbols have the same meaning as in the basic model of Turing 
machines with the difference that the input alphabet £ contains two special 
symbols (f and $. (J is called the left-end marker which is entered in the left¬ 
most cell of the input tape and prevents the R/W head from getting off the left 
end of the tape. $ is called the right-end marker which is entered in the right¬ 
most cell of the input tape and prevents the R/W head from getting off the right 
end of the tape. Both the endmarkers should not appear on any other cell within 
the input tape, and the R/W head should not print any other symbol over both 
the endmarkers. 

Let us consider the input string vv with |w| = n - 2. The input string w can 
be recognized by an LBA if it can also be recognized by a Turing machine 
using no more than kn cells of input tape, where k is a constant specified in the 
description of LBA. The value of k does not depend on the input string but is 
purely a property of the machine. Whenever we process any string in LBA, we 
shall assume that the input string is enclosed within the endmarkers (£ and $. 
The above model of LBA can be represented by the block diagram of Fig. 9.11. 
There are two tapes: one is called the input tape, and the other, working tape. 
On the input tape the head never prints and never moves to the left. On the 
working tape the head can modify the contents in any way, without any 
restriction. 


n ceils 


9 

Input 

tape 








□ 

□ 

$ 



Working tape 


Fig. 9.11 Model of linear bounded automaton. 


In the case of LBA. an ID is denoted by ( q , w\ &), where q e Q, w e T 
and k is some integer between 1 and n. The transition of IDs is similar except 
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that k changes to k - 1 if the RAV head moves to the left and to k + 1 if the 
head moves to the right. 

The language accepted by LBA is defined as the set 

{w 6 (L — {(£, $}|(<70' 1) p- ^ 0 

for some q e F and for some integer / between 1 and n). 

Note: As a null string can be represented either by the absence of input string 
or by a completely blank tape, an LBA may accept the null string. 


9.8.1 Relation Between LBA and Context-sensitive 
Languages 

The set of strings accepted by nondeterministic LBA is the set of strings 
generated by the context-sensitive grammars, excluding the null strings. Now 
we give an important result: 

IfL is a context-sensitive language, then L is accepted by a linear bounded 
automaton . The converse is also true. 

The construction and the proof are similar to those for Turing machines 
with some modifications. 

9.9 TURING MACHINES AND TYPE 0 GRAMMARS 

In this section we construct a type 0 grammar generating the set accepted by 
a given Turing machine M. The productions are constructed in two steps. In 
step 1 we construct productions which transform the string [q { $ \v$] into the 
string [qib], where q\ is the initial state, q 2 is an accepting state, (f is the left- 
endmarker, and $ is the right-endmarker. The grammar obtained by applying 
step 1 is called the transformational grammar . In step 2 we obtain inverse 
production rules by reversing the productions of the transformational grammar 
to get the required type 0 grammar G. The construction is in such a way that 
vr is accepted by M if and only if w is in L(G). 


9.9.1 Construction of a Grammar Corresponding 
to TM 

For understanding the construction, we have to note that a transition of ED 
corresponds to a production. We enclose IDs within brackets. So acceptance of 
n by M corresponds to the transformation of initial ID [q { (£ w $] into [g 2 fr]* 
Also, the 'length’ of ED -may change if the RAV head reaches the left-end or the 
right-end, i.e. when the left-hand side or the right-hand side bracket is reached. 
So we get productions corresponding to transition of EDs with (i) no change in 
length, and (ii) change in length. We assume that the transition table is given. 
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We now describe the construction which involves two steps: 

Step 1 (i) No change in length of IDs: (a) Right move . a k Rq f corresponding 

to q r row and ^-column leads to the production 

Qflj a k c il 

(b) Left move . ci k Lq l corresponding to q r xow and c^-column yields several 
productions 

a m q-, a j -> C],a m a k for all a m e T 

(ii) Change in length o/IDs: (a) Left-end. a k Lq, corresponding to <? r row 
and ^-column gives 

[qflj \cjiba k 

When b occurs next to the left-bracket, it can be deleted. This is achieved 
by including the production [b —> [. 

(b) Right-end. When b occurs to the left of ], it can be deleted. This is 
achieved by the production 

af] aj] for all aj e T 

When the R/W head moves to the right of ], the length increases. 
Corresponding to this we have a production 

qii -> cjfb] for all q { e Q 

(iii) Introduction of endmarkers. For introducing endmarkers for the input 
string, the following productions are included: 

ai —> [q\ <$ (i\ for a ,* e T, a } * b 

a { —> a,$] for all a { e T, a { ^ b 

For removing the brackets from [q 2 b], we include the production 

[q 2 b\ -> 5 

Recall that q ] and q 2 are the initial and final states, respectively. 

Step 2 To get the required grammar, reverse the arrows of the productions 
obtained in step 1. The productions w'e get can be called inverse productions . 
The new grammar is called the generative grammar . We illustrate the 
construction with an example. 


EXAMPLE 9.11 


Consider the TM described by the transition table given in Table 9.9. Obtain 
the inverse production rules. 

Solution 

In this example. q } is both initial and final. 

Step 1 (i) Productions corresponding to right moves 

qf §-—^ (£(/{, Q ^ bq 2 , q 2 \ —^ bq\ 


(9.1) 
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(ii) (a) Productions corresponding to left-end 

[b [ (9.2) 

(b) Productions corresponding to rig 1 it-end 

bb] -> bl 1 b] 1], q x ] -> cpbl q 2 ] ^ q 2 b] (9.3) 

(iii) 1 — > 1 1$], Uj}b] —> S (9.4) 


TABLE 9.9 Transition Table for Example 9.11 


Present state 

$ 

b 

1 


$Rq-\ 


bRq 2 




t>Rq- 


Step 2 The inverse productions are obtained by reversing the arrows of the 
productions (9.1)-(9.4). 

(fel “> q\$ bq 2 q\\, bq\ q 2 1 

[ [b, b] -» to], 1] -> lb] 

q\b —> q \], r/ 2 ^ —» ^]- [?i$l 1 

IS] 1, 5 -4 [qf] 

Thus we have shown that there exists a type 0 grammar corresponding to 
a Turing machine. The converse is also true (we are not proving this), i.e. given 
a type 0 grammar G, there exists a Turing machine accepting L(G). Actually, 
the class of recursively enumerable sets, the type 0 languages, and the class of 
sets accepted by TM are one and the same. We have shown that there exists 
a recursively enumerable set which is not a context-sensitive language (see 
Theorem 4.4). As a recursive set is recursively enumerable, Theorem 4.4 gives 
a type 0 language which is not type 1. Hence, £ csl c £§ (cf Property 4, 
Section 4.3) is established. 


9.10 LINEAR BOUNDED AUTOMATA AND LANGUAGES 

A linear bounded automaton M accepts a string w if. after starting at the initial 
state with R/W head reading the left-endmarker, M halts over the right-end- 
marker in a final state. Otherwise, w is rejected. 

The production rules for the generative grammar are constructed as in the 
case of Turing machines. The following additional productions are needed in 
the case of LB A. 

a^fS —> q f $ for all e T 

(fef$ $7/» c if 
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EXAMPLE 9.12 


Find the grammar generating the set accepted by a linear bounded automaton 
M whose transition table is given in Table 9.10. 


TABLE 9.10 Transition Tabie for Example 9.12 


Present state 


Tape input symbol 


<P 

$ 

0 

1 

->Qi 

$Rqi 


14b 

0 Rq 2 

Q2 

<$Rq 4 


■\Rq z 

ILg, 

Qz 


$4i 

1RC?3 

1Rqr 3 . 

© 


Halt 

0Lq 4 

0Rq 4 


Solution 

Step 1 (A) (i) Productions corresponding to right moves . The seven right 

moves in Table 9.10 give the following productions: 

(Jx/]. 0 —> 1<?3 

cj\ 1 —^ 0<72* < 73 1 ^ k /3 (9.5) 

q 2 § —> (f<74, g 4 l —* 0^4 

qfd —^ 1<73 

(ii) Productions corresponding to left moves. There are four left moves in 
Table 9.10. Each left move yields four productions (corresponding to the four 
tape symbols). These are: 

(a) lLq 2 corresponding to #prow and 0 -column gives 

(M —» ^(fl. S^iO —> < 72 $T 0q { 0 -A r/201, 1<7]0 —> q 2 \\ (9.6) 

(b) 1 Lq { corresponding to < 7 r row and 1-column yields 

(£< 7^1 —^ < 7 j(£l, S<7"*1 —^ < 7 j$l. 0<7~T —^ <7i^ ? l^jhl —^ < 7.1 H (9.7) 

(c) corresponding to < 73 -row and $-column gives 

(£<73$ —> <7i(f $, $<73$ —^ r/jSS, 0^3$ —^ r/jOS, 1*73$ —^ <71 f$ ( 9 . 8 ) 

(d) 0 L< 7 4 corresponding to < 7 4 -row and 0 -column yields 

(t? 4 0 —^ ^(£0, $<740 —^ ^4$0, 0^0 —^ ^ 00 , 1<7 4 0 —^ < 74 19 (9.9) 

(B) There are no productions corresponding to change in length. 

(C) The productions for introducing the endmarkers are 


4 ^ [ 9l <t<t 



$ [</iC£$, 

$ -»$$] 

(9.10) 

0 -> [ 9l (£o, 

0 -*■ 0$] 


1 [<7i^£ 1' 

1 —> 1$] 



[<7 4 ] —» S 


(9.11) 
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(D) The LBA productions are 

(f^4$ —> <74$, $<74 

$<74$ —* Cj^S, cf <74 —> Cj 4 (9.12) 

0r/4$ -4 ^/4$. 

k/4$ —> r/4$ 

Step 2 The productions of the generative grammar are obtained by reversing 
the arrows of productions given by (9.5)-(9.12). 

9.11 SUPPLEMENTARY EXAMPLES 


EXAMPLE 9.13 


Design a TM that copies strings of l’s. 

Solution 

We design a TM so that we have vvvr after copying w e {1}*. Define M by 
M = ({c/ 0 - <7h <7> <??}• UK U- b\, <5, q 0 , b, {</ 3 }) 
where <5 is defined by Table 9.11. 


TABLE 9.11 Transition Table for Example 9.13 


Present state 


Tape symbol 


1 

b 

a 

Qo 

q 0 aR 

Q-,bL 

— 

Qi 

q<M- 

q 3 bR 

Q2^R 

Q2 

q 2 1R 

q^L 

— 

Qo 

— 

— 

— 


The procedure is simple. 

M replaces every 1 by the symbol a. Then M replaces the rightmost a by 
1. It goes to the right end of the string and writes a 1 there. Thus M has added 
a 1 for the rightmost 1 in the input string w. This process can be repeated. 

M reaches q { after replacing all Vs by a s and reading the blank at the end 
of the input string. After replacing a by 1, M reaches q 2 . M reaches q 3 at the 
end of the process and halts. If \v = IT than we have i 2n at the end of the 
computation. A sample computation is given below. 

<7ol 1 «<7ol |— aaq 0 b \— acj x a 

|— a\q 2 b I— aq\W |— q x a\\ 

|— 1^11 (— llq 2 l |— lllq 2 b 

f— Uq 2 U |— lq x lll 

|— ^llll |— q { b\\U |— <? 3 1111 
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EXAMPLE 9.14 


Construct a TM to accept the set L of all strings over {0,1} ending with 010. 

Solution 

L is certainly a regular set and hence a deterministic automaton is sufficient to 
recognize L. Figure 9.12 gives a DFA accepting L. 



Fig. 9.12 DFA for Example 9.14. 


Converting this DFA to a TM is simple. In a DFA Af, the move is always to 
the right. So the TM's move will always be to the right. Also M reads the input 
symbol and changes state. So the TM M x does the same; it reads an input 
symbol, does not change the symbol and changes state. At the end of the 
computation, the TM sees the first blank b and changes to its final state. The 
initial ID of M 1 is q Q w. By defining S(q Qt b) = (q h b, R), M\ reaches the initial 
state of M. M\ can be described by Fig. 9.13. 


(0, 0, R) 



Fig. 9.13 TM for Example 9.14. 


Note: q$ is the unique final state of M\. By comparing Figs. 9.12 and 9.13 it 
is easy to see that strings of L are accepted by Mj. 


EXAMPLE 9.15 


Design a TM that reads a string in {0 ? 1}* and erases the rightmost symbol. 

Solution 

The required TM M is given by 

M = ({go. < 71 , <?2, < 73 , qi\- {0, 1}, {0. 1, b], 6. q 0 . b, {q 4 }) 
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where S is defined by 


0<q (j , 0) = 

(<7b 

0. 

R) 

8(q 0 , 

1) = 

= to. 

l, 

R) 

(Ri) 

S(qi. 0) = 

: (<?b 

o, 

R) 

%i. 

1) = 

= (<7i. 

l, 

R) 

(R : ) 

8(qi, b) - 

: (q 2 - 

b. 

L) 






(R 3 ) 

8(q 2 , 0) = 

- to. 

b. 

L) 

S(q 2 , 

1) = 

= to. 

b. 

L) 

(R 4 ) 

Oo 

•ST 

O 

ii 

: to. 

0, 

L) 

S(q 2 , 

1) = 

= to. 

1, 

L) 

(R 5 ) 

S(q 2 . b) - 

: to. 

b. 

R) 






(Rfi) 


Let w be the input string. By (Rj) and (R 2 ), M reads the entire input string 
w. At the end, M is in state q x . On seeing the blank to the right of w, M reaches 
the state q 2 and moves left. The rightmost string in w is erased (by (R 4 )) and 
the state becomes q 2 . Afterwards M moves to the left until it reaches the left- 
end of w. On seeing the blank b to the right of \\\ M changes its state to # 4 , 
which is the final state of M. From the construction it is clear that the rightmost 
symbol of w is erased. 


EXAMPLE 9.16 


Construct a TM that accepts L - {0"' | n > 0}. 

Solution 

Let w be an input string in {0}*. The TM accepting L functions as follows: 

1. It writes b (blank symbol) on the leftmost 0 of the input string vv. This 
is done to mark the left-end of w. 

2. M reads the symbols of w from left to right and replaces the alternate 
0‘s with Ms. 

3. If the tape contains a single 0 in step 2. M accepts w. 

4. If the tape contains more than one 0 and the number of 0’s is odd in 
step 2, M rejects vt\ 

5. M returns the head to the left-end of the tape (marked by blank b in 
step 1). 

6. M goes to step 2. 

Each iteration of step 2 reduces \v to half its size. Also whether the number 
of CTs seen is even or odd is known after step 2. If that number is odd and 
greater than 1, w cannot be 0"’ (step 4). In this case M rejects w. If the number 
of 0‘s seen is 1 (step 3), M accepts w (In this case 0 2 ' is reduced to 0 in 
successive stages of step 2). 

We define M by 

M = (Ulo- <7b <72- <7?- cif- <?/}, {0}, {0, X, b}, S. q 0 , b. {q f }) 
where <5 is defined by Table 9.12. 




306 B Theory of Computer Science 


TABLE 9.12 

Transition Table for Example 9.16 


Present state 


Tape symbol 


0 

b 

A* 

Qq 

bRq, 

bRq t 

xRq, 

Q: 

xRq 2 

bRq f 

xRq-t 

Ql 

0Rq z 

bRq A 

xRq 2 

Qz 

xRq 2 

bRq 6 

xRq-i 

Qa 

n. 

0Lq 4 

bRq x 

1 1 

Wr 

Qt 

— 

— 

— 


From the construction, it is apparent that the states are used to know 
whether the number of 0 ? s read is odd or even. 

We can see how M processes 0000. 

< 7 0 0000 |— bq i000 |— bxq 2 00 |— bxq 2 0 |— bx0xq 2 b 

j— bxOq^xb j— bxq^Oxb j— bq^xQxb |— q^bxOxb 
|— hq\xOxb |— bxq { 0xb |— bxxq 2 xb |— bxxxq 2 b 
|— bxxq^xb j— bxq^xxb j— bq+xxxb |— q A bxxxb 
|— bq { xxxb |— bxq { xxb |— bxxq x xb |— bxxxq x b 
j— bxxxbqq 

Hence M accepts vv. 

Also note that M always halts. If M reaches cjf, the input string w is 
accepted by M. If M reaches q r w is not accepted by M; in this case M halts 
in the trap state. 


EXAMPLE 9.17 


Let M = ({^o, q x q 2 }. {0. 1}, {0, 1, b}. 8, q 0 , {< 7 2 }) 
where 8 is given by 

%o 0) = (q h 1, R ) (Ri) 

1) = (<7o- R ) ( R ’) 

8(q u b ) = (q 2 , b, R) (R 3 ) 

Find T(M). 

Solution 

Let w € T(M). As <5(go 1) is not defined, w cannot start with 1. From (R t ) 
and (R 2 ), we can conclude that M starts from q 0 and comes back to q 0 after 
reaching 01. 

So, <7o(01)” (10)'‘<7o- Also, qqOb |— \q x b j— 1 bq z - 
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So, (01)"0 e TiM). Also, (01)"0 is the only string that makes M move from 
r/o to ch. Hence, TiM) = {(01)"0 | n >0}. 


SELF-TEST 

Choose the correct answer to Questions 1-10: 

1. For the standard TM: 

(a) I = r 

(b) r C x 

(c) x c r 

(d) X is a proper subset of F. 

2. In a standard TM. 8{q. a), q e Q. a € T is 

(a) defined for all ( q. a) e Q x F 

(b) defined for some, not necessarily for all (q, a) e Q x T 

(c) defined for no element (q. a) of Q x F 

(d) a set of triples with more than one element. 

3. If S(q. Xj) = ip. y. L), then 

(a) x x x 2 ■ ■ ■ X:_iqx, .. . x„ (— .v,.v 2 ... ... x n 

(b) XuX 2 . ■ ■ Xj_ x qx.; . . . X n ]- X x x 2 ■ ■ ■ X;_,XpX;+\ ... x n 

(c) x x x 2 . . . Xj^qXj . .. x„ j— x, - x i _ 3 px i _ 2 X;_ 1 yx i+l . . . x„ 

(d) . . . x H qx; . . . x„ |— .r-, . . . x i+1 pyx i+2 .. . x„ 

4. If 8(q. Xj) = ip. y. R). then 

(a) X X X 2 . . . -Y;_] qXj ... x„ x { x 2 . . ■ A-lJP-F+1 • • • x n 

(b) A'|.xy . . . x^qxj . .. x„ (— .Y].y 2 . . . xpx M ... x n 

(c) a-,.v 2 - Xj_ x qxi . . . x n x,x 2 . . . x,_]pXjXj + i .. . x„ 

(d) x t x 2 ... Xj_ { qxi . . . x n |— x { x 2 .. . Xi_ { ypx M ... x„ 

5. If 8(q. a'j) = (p, y. L). then 

(a) qx\x 2 ... x„ |— pyx 2 . . . x n 

(b) qx\x 2 . . . -V,, |— ypx 2 ... x n 

(c) qx\x 2 . . . x„ |— pbxj . . . x n 

(d) qx\x 2 . . . x n |— pbx 2 . . . x n 

6. If Siq. x„) = ip, y. R). then 

(a) *1 . . . x n _]qx n 1— pyx 2 x : . . . . x„ 

(b) a, . . . x n _ x qx n p- pyx 2 x 3 . . . x„ 

(c) .V] . . . x n _ x qx n |— .v..v 2 . .. x n _ x ypb 

(d) x x . . . x n _ x qx n p~ a 1 a 2 . . . x n _ x ypb 

7. For the TM given in Example 9.6: 

(a) bqjllbbl 

(b) quit1 1 j— bq,llbbl 

(c) qf)lbll |— \q 0 b\ll 

(d) 9o l*H (— q : Mlbbl 
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8. For the TM given in Example 9.4: 

(a) Oil is accepted by M 

(b) 001 is accepted by M 

(c) ' 00 is accepted by M 

(d) 0011 is accepted by M. 

9. For the TM given in Example 9.5: 

(a) 1 is accepted by M 

(b) 11 is accepted by M 

(c) 111 is accepted by M 

(d) 11111 is accepted by M 

10. In a standard TM (Q. I, F. 5. q Q , b, F) the blank symbol b is 

(a) in X - T 

(b) in T - X 

(c) T n X 

(d) none of these 


EXERCISES 

9.1 Draw the transition diagram of the Turing machine given in Table 9.1. 

9.2 Represent the transition function of the Turing machine given in 
Example 9.2 as a set of quintuples. 

9.3 Construct the computation sequence for the input Ibll for the Turing 
machine given in Example 9.5. 

9.4 Construct the computation sequence for strings 1213, 2133, 312 for the 
Turing machine given in Example 9.8, 

9.5 Explain how a Turing machine can be considered as a computer of integer 
functions (i.e. as one that can compute integer functions; we shall discuss 
more about this in Chapter 11). 

9.6 Design a Turing machine that converts a binary string into its equivalent 
unary string. 

9.7 Construct a Turing machine that enumerates {0 W 1"| n > 1}. 

9.8 Construct a Turing machine that can accept the set of all even 
palindromes over {0, 1}. 

9.9 Construct a Turing machine that can accept the strings over {0, 1} 
containing even number of l’s. 

9.10 Design a Turing machine to recognize the language { a n b Il c m \ n. m> 1}. 

9.11 Design a Turing machine that can compute proper subtraction, i.e. 
m - n . where m and n are positive integers, m - n is defined as m - n 
if m > n and 0 if m < n. 



Decidability and 
Recursively Enumerable 
Languages 


In this chapter the formal definition of an algorithm is given. The problem of 
decidability of various class of languages is discussed. The theorem on halting 
problem of Turing machine is proved. 

10.1 THE DEFINITION OF AN ALGORITHM 

In Section 4.4, we gave the definition of an algorithm as a procedure (finite 
sequence of instructions which can be mechanically carried out) that terminates 
after a finite number of steps for any input. The earliest algorithm one can think 
of is the Euclidean algorithm, for computing the greatest common divisor of 
two natural numbers. In 1900, the mathematician David Hilbert, in his famous 
address at the International congress of mathematicians in Paris, averred that 
every definite mathematical problem must be susceptible for an exact settlement 
either in the form of an exact answer or by the proof of the impossibility of its 
solution. He identified 23 mathematical problems as a challenge for future 
mathematicians; only ten of the problems have been solved so far. 

Hilbert's tenth problem was to devise ‘a process according to which it can 
be determined by a finite number of operations’, whether a polynomial over 
Z has an integral root. (He did not use the word 'algorithm: but he meant the 
same.) This was not answered until 1970. 

The formal definition of algorithm emerged after the works of Alan Turing 
and Alanzo Church in 1936. The Church-Turing thesis states that any 
algorithmic procedure that can be carried out by a human or a computer, can 
also be carried out by a Turing machine. Thus the Turing machine arose as 
an ideal theoretical model for an algorithm. The Turing machine provided a 
machinery to mathematicians for attacking the Hilberts' tenth problem. The 
problem can be restated as follows: does there exist a TM that can accept a 
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polynomial over n variables if it has an integral root and reject the polynomial 
if it does not have one. 

In 1970, Yuri Matijasevic. after studying the work of Martin Davis, Hilary 
Putnam and Julia Robinson showed that no such algorithm (Turing machine) 
exists for testing whether a polynomial over n variables has integral roots. Now 
it is universally accepted by computer scientists that Turing machine is a 
mathematical model of an algorithm. 

10.2 DECIDABILITY 

We are familiar with the recursive definition of a function or a set. We also 
have the definitions of recursively enumerable sets and recursive sets (refer to 
Section 4.4). The notion of a recursively enumerable set (or language) and a 
recursive set (or language) existed even before the dawn of computers. 

Now these terms are also defined using Turing machines. When a Turing 
machine reaches a final state, it ‘halts/ We can also say that a Turing machine 
M halts when M reaches a state q and a current symbol a to be scanned so 
that 8{q. a) is undefined. There are TMs that never halt on some inputs in any 
one of these ways. So we make a distinction between the languages accepted 
by a TM that halts on all input strings and a TM that never halts on some input 
strings. 

Definition 10.1 A language L c X* is recursively enumerable if there exists 
a TM M, such that L - T(M). 

Definition 10.2 A language L c I* is recursive if there exists some 
TM M that satisfies the following two conditions. 

(i) If vv e L then M accepts w (that is. reaches an accepting state on 
processing w) and halts. 

(ii) If iv g L then M eventually halts, without reaching an accepting state. 

Note: Definition 10.2 formalizes the notion of an ‘algorithm’. An algorithm, 
in the usual sense, is a well-defined sequence of steps that always terminates 
and produces an answer. The Conditions (i) and (ii) of Definition 10.2 assure 
us that the TM always halts, accepting w under Condition (i) and not accepting 
under Condition (ii). So a TM. defining a recursive language (Definition 10.2) 
always halts eventually just as an algorithm eventually terminates. 

A problem with only two answers Yes/No can be considered as a language 
L An instance of the problem with the answer ‘Yes' can be considered as an 
element of the corresponding language L; an instance with answer ‘No’ is 
considered as an element not in L. 

Definition 103 A problem with two answers (Yes/No) is decidable if the 
corresponding language is recursive. In this case, the language L is also called 
decidable. 
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Definition 10.4 A problem/language is undecidable if it is not decidable. 

Note: A decidable problem is called a solvable problem and an undecidable 
problem an unsolvable problem by some authors. 

10,3 DECIDABLE LANGUAGES 

In this section we consider the decidability of regular and context-free 
languages. 

First of all. we consider the problem of testing whether a deterministic 
finite automaton accepts a given input string w\ 

Definition 10.5 

A d p A = {{B, w) | B accepts the input string w} 

Theorem 10.1 A dfa is decidable. 

Proof To prove the theorem, we have to construct a TM that always halts 
and also accepts A dfa . We describe the TM M using high level description 
(refer to Section 9.5). Note that a DFM B always ends in some state of B after 
n transitions for an input string of length n . 

We define a TM M as follows: 

1. Let B be a DFA and w an input string. {B, w) is an input for the Turing 
machine M. 

2. Simulate B and input vr in the TM M. 

3. If the simulation ends in an accepting state of B . then M accepts w. 
If it ends in a nonaccepting state of B , then M rejects w. 

We can discuss a few implementation details regarding steps 1. 2 and 3 
above. The input (F. w) for M is represented by representing the five 
components Q, X. <5, f by strings of X* and input string w e X*. M checks 
whether (2T w) is a valid input. If not, it rejectes (B, w) and halts. If (B, w) 
is a valid input. M writes the initial state q§ and the leftmost input symbol of 
w. It updates the state using S and then reads the next symbol in w. This 
explains step 2. 

If the simulation ends in an accepting state w, then M accepts (B, w).. 
Otherwise, M rejects (B , w). This is the description of step 3. 

It is evident that M accepts (B, w) if and only if h- is accepted by the 
DFA B. I 

Definition 10.6 

A C fg = {(G, w) j the context-free grammar G accepts the input string w} 
Theorem 10.2 A CFG is decidable. 

Proof We convert a CFG into Chomsky normal form. Then any derivation 
of w of length k requires 2 k - 1 steps if the grammar is in CNF (refer 
to Example 6.18). So for checking whether the input string w of length k is 
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in L(G), it is enough to check derivations in 2k - 1 steps. We know that there 
are only finitely many derivations in 2k - 1 steps. Now we design a TM M 
that halts as follows. 

1. Let G be a CFG in Chomsky normal form and w an input string. 
(G, w) is an input for M. 

2. If k - 0. list all the single-step derivations. If k ^ 0, list all the 
derivations with 2k - 1 steps. 

3. If any of the derivations in step 2 generates the given string vr, M 
accepts ( G, tv). Otherwise M rejects. 

The implementation of steps 1-3 is similar to the steps in Theorem 10.1. 
{G, vc) is represented by representing the four components V N , X, P, S of G 
and input string vc. The next step of the derivation is got by the production 
to be applied. 

M accepts (G, vr) if and only if vc is accepted by the CFG G. 

In Theorem 4.3, we proved that a context-sensitive language is recursive. 
The main idea of the proof of Theorem 4.3 was to construct a sequence 
{Wo, W\, . • WjJ of subsets of (V N u X)*, that terminates after a finite 
number of iterations. The given string w e X* is in L(G ) if and only if vc € 
W k , With this idea in mind we can prove the decidability of the context- 
sensitive language. I 

Definition 10.7 A CSG = {(G, vc) | the context-sensitive grammar G accepts 
the input string vr}. 

Theroem 10.3 A C sg i s decidable. 

Proof The proof is a modification of the proof of Theorem 10.2. In 
Theorem 10.2, we considered derivations with 2k - 1 steps for testing whether 
an input string of length k was in L(G). In the case of context-sensitive 

grammar we construct W) = {a e (V v u X)* | S => oc in i or fewer steps and 

G 

\a\ < n}. There exists a natural number k such that W k = W M - W k+2 = . . . 
(refer to proof of Theorem 4.3). 

So w € L(G) if and only if w € W k . The construction of W k is the key 
idea used in the construction of a TM accepting A CSG . Now we can design a 
Turing machine M as follows: 

1. Let G be a context-sensitive grammar and w an input string of length 
n. Then (G, w) is an input for TM. 

2. Construct W 0 = {5}. W i+{ = W t u {/3 g (V v u X)" | there exists 
a { e W i such that a => /3 and |/3| < n }. Continue until W k = W* +J 
for some k. (This is possible by Theorem 4.3.) 

3. If w e W h vc € L(G) and M accepts (G, w); otherwise M rejects 
(G, vv). I 

Note: If denotes the class of all decidable languages over X, then 

Ai c i-cfl c ^csi c £ d 
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10.4 UNDECIDABLE LANGUAGES 

In this section we prove the existence of languages that are not recursively 
enumerable and address the undecidability of recursively enumerable 
languages. 

Theorem 10.4 There exists a language over X that is not recursively 
enumerable. 

Proof A language L is recursively enumerable if there exists a TM M such 
that L = T(M). As X is finite, X* is countable (that is, there exists a one-to- 
one correspondence between X* and N). 

As a Turing machine M is a 7-tuple (Q, X, F, 8 y q (h b, F) and each 
member of the 7-tuple is a finite set, M can be encoded as a string. So the 
set I of ail TMs is countable. 

Let £ be the set of all languages over X. Then a member of £ is a subset 
of X* (Note that X* is infinite even though X is finite). We show that £ is 
uncountable (that is, an infinite set not in one-to correspondence with N). 

We prove this by contradiction. If £ were countable then £ can be 
written as a sequence {L b Ly Ly . . We write X* as a sequence {w h w 2 , 
w ? . . . So £ can be represented as an infinite binary sequence x n x j2 x i?! . . . 
where 

fl if w i e L: 

*ij = O , J . 

^0 otherwise 

Using this representation we write £ as an infinite binary sequence. 

L x : x u x l2 x l3 ■ • • -L; • • • 

£ : ^2 1 V 22 X 2 3 . . . X2j • • ■ 

Lj . -X^X/2^/3 ■ • ■ Xjj • . . 

Fig. 10.1 Representation of £. 

We define a subset L of X* by the binary sequence , . . where >7 = 

1 - Xft. If Xj; = 0, }'j = I and if x h - = 1. y. ; = 0. Thus according to our assumption 
the subset L of X* represented by the infinite binary sequence y L y : y3 . . . 
should be L k for some natural number k. But L ^ L kf since w k e L if and only 
if w k £ L h This contradicts our assumption that £ is countable. Therefore £ 
is uncountable. As 1 is countable. £ should have some members not 
corresponding to any TM in L This proves the existence of a language over 
X mat is not recursively enumerable. I 

Definition 10.8 A TM - {(Af, \v) j The TM M accepts vv}. 
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Theorem 10.5 A TM is undecidable. 

Proof We can prove that A TM is recursively enumerable. Construct a TM IJ 
as follows: 

(M, vr) is an input to U. Simulate Ml on vv. If M enters an accepting state, 
U accepts (M, w). Hence A TM is recursively enumerable. We prove that A TM 
is undecidable by contradiction. We assume that At m is decidable by a TM H 
that eventually halts on all inputs. Then 


H(M, w) = 


f accept if M accepts w 
[reject if M does not accept w 


We construct a new TM D with H as subroutine. D calls H to determine 
what M does when it receives the input (M). the encoded description of M as 
a string. Based on the received information on (M, (M)), D rejects M if M 
accepts (M) and accepts M if M rejects (M). D is described as follows: 

1. (M) is an input to A where (M) is the encoded string representing M. 

2. D calls H to run on (M, (M)) 

3. D rejects (. M) if H accepts (M, (M)) and accepts ( M) if H rejects 
(M, (M)). 

Now step 3 can be described as follows: 


[accept if M does not accept (M) 

D((M)) = 

[reject if M accepts (M) 

Let us look at the action of D on the input (D). According to the 
construction of A 


A«D» 


J accept if D does not accept (D) 
[reject if D accepts (D) 


This means D accepts \D) if D does not accept (D), which is a 
contradiction. Hence ATM is undecidable. I 

The Turing machine U used in the proof of Theorem 10.5 is called the 
universal Turing machine . U is called universal since it is simulating any other 
Turing machine. 


10,5 HALTING PROBLEM OF TURING MACHINE 

In this section we introduce the reduction technique. This technique is used to 
prove the undecidability of halting problem of Turing machine. 

We say that problem A is reducible to problem B if a solution to problem 
B can be used to solve problem A. 

For example, if A is the problem of finding some root of x 4 - 3x 2 + 2 = 0 
and B is the problem of finding some root of v" -2=0, then A is reducible 
to B . As x~ - 2 is a factor of x*- - 3.v : + 2. a root of v 2 - 2 = 0 is also a root 
of A - 4 - 3.r + 2 = 0. 
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Note: If A is reducible to B and B is decidable then A is decidable. If A is 
reducible to B and A is undecidable, then B is undecidable. 

Theorem 10.6 HALT m = {(M, w) | The Turing machine M halts on input 
w} is undecidable. 

Proof We assume that HALT ' TM is decidable, and get a contradiction. Let M\ 

be the TM such that T(Mf) - HALT m and let M x halt eventually on all 

(M, w). We construct a TM M 2 as follows: 

1. For M 2 , (M, w) is an input. 

2. The TM M x acts on (M, w). 

3. If M } rejects (M, w) then M 2 rejects (M, w). 

4. If M x accepts (M, w), simulate the TM M on the input string w until 

M halts. 

5. If M has accepted w, M 2 accepts (M, w); otherwise M 2 rejects (Af, w). 

When Mi accepts (M, w) (in step 4), the Turing machine M halts on w. 
In this case either an accepting state ^ or a state q such that 8{q\ a) is 
undefined till some symbol a in w is reached. In the first case (the first 
alternative of step 5) M : accepts (M, w). In the second case (the second 
alternative of step 5) M 2 rejects (M, w). 

It follows from the definition of M 2 that M 2 halts eventually. 

Also, T(M 2 ) {{M, w) | The Turing machine accepts w} 

= ^TM 

This is a contradiction since A TM is undecidable. I 

10.6 THE POST CORRESPONDENCE PROBLEM 

The Post Correspondence Problem (PCP) was first introduced by Emil Post 
in 1946. Later, the problem was found to have many applications in the theory 
of formal languages. The problem over an alphabet L belongs to a class of 
yes/no problems and is stated as follows: Consider the two lists x - (x\ ... x „), 
y = (yi .. . y„) of nonempty strings over an alphabet 2 = {0, 1}. The PCP 
is to determine whether or not there exist i x , . . ., i m , where 1 < ij < n, such 
that 

A'jj . . . Xj ... }i m 

Note: The indices if s need not be distinct and m may be greater than n. 
x41so, if there exists a solution to PCP, there exist infinitely many solutions. 



Does the PCP with two lists jc = ( b, bab 3 , bd) and y = (b 3 , ba, a ) have a 
solution? 
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Solution 

We have to determine whether or not there exists a sequence of substrings of 
x such that the string formed by this sequence and the string formed by the 
sequence of corresponding substrings of y are identical. The required sequence 
is given by q = 2, z 2 = 1, Z 3 = 1, z ‘4 = 3, i.e. (2, 1, 1.3), and m = 4. The 
corresponding strings are 


baV 


b 


b 


ba 

= 

ba 


b 3 


b 3 


M 

x 2 


X\ 

*3 


>’2 

>’1 

y \ 

>’3 


Thus the PCP has a solution. 


EXAMPLE 10.2 


Prove that PCP with two lists * = (01, 1, 1), y = (Oi 2 , 10, l 1 ) has no solution. 

Solution 

For each substring jq € x and y,- e y, we have | | < | y,- | for all i. Hence 

the string generated by a sequence of substrings of x is shorter than the string 
generated by the sequence of corresponding substrings of y. Therefore, the PCP 
has no solution. 

Note: If the first substring used in PCP is always x\ and y h then the PCP 
is known as the Modified Post Correspondence Problem . 


EXAMPLE 10.3 


Explain how a Post Correspondence Problem can be treated as a game of 
dominoes. 

Solution 

The PCP may be thought of as a game of dominoes in the following way: Let 
each domino contain some x; in the upper-half, and the corresponding 
substring of y in the lower-half. A typical domino is shown as 

jq upper-half 

)’/ lower-half 


The PCP is equivalent to placing the dominoes one after another as a 
sequence (of course repetitions are allowed). To win the game, the same string 
should appear in the upper-half and in the lower-half. So winning the game 
is equivalent to a solution of the PCP. 
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We state the following theorem by Emil Post without proof. 

Theorem 10.7 The PCP over X for |E| > 2 is unsolvable. 

It is possible to reduce the PCP to many classes of two outputs 
(yes/no) problems in formal language theory. The following results can be 
proved by the reduction technique applied to PCP. 

1. If L x and L 2 are any two context-free languages (type 2) over an 
alphabet 1 and |E| > 2, there is no algorithm to determine whether or 
not 

(a) L, n L 2 = 0, 

(b) L, n L 2 is a context-free language, 

(c) L x c L 2 , and 

(d) L\ = L 2 . 

2. If G is a context-sensitive grammar (type 1), there is no algorithm to 
determine whether or not 

(a) L(G ) = 0, 

(b) L(G ) is infinite, and 

(c) x 0 e L(G) for a fixed string xq . 

3. If G is a type 0 grammar, there is no algorithm to determine whether 
or not any string x e X* is in L(G). 

10.7 SUPPLEMENTARY EXAMPLES 


EXAMPLE 10.4 


If L is a recursive language over X, show that L (L is defined as X* - L) is 
also recursive. 

Solution 

As L is recursive, there is a Turing machine M that halts and T(M) = L. We 
have to construct a TM M\, such that T(M { ) = L and M { eventually halts. 
M x is obtained by modifying M as follows: 

1. Accepting states of M are made nonaccepting states of M\. 

2. Let M { have a new state qp After reaching qp does not move in 
further transitions. 

3. If q is a nonaccepting state of M and 8(q, x) is not defined, add a 
transition from q to qf for M\. 

As M halts, Mi also halts. (If M reaches an accepting state on w, then M\ 
dees not accept w and halts and conversely.) 

Also Mi accepts w if and only if M does not accept w. So / is recursive. 
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EXAMPLE 10.5 


If L and L are both recursively enumerable, show that L and L are recursive. 

Solution 

Let and M 2 be two TMs such that L = T(M{) and L = T(M 2 ). We construct 
a new two-tape TM M that simulates M. } on one tape and M 2 on the other. 

If the input string w of M is in L, then M { accepts w and we declare that 
M accepts u\ If w e L, then M 2 accepts w and we declare that M halts without 
accepting. Thus in both cases, M. eventually halts. By the construction of M 
it is clear that T(M) = T{M x ) = L Hence L is recursive. We can show that 
L is recursive, either by applying Example 10.4 or by interchanging the roles 
of M x and M 2 in defining acceptance by M. 


EXAMPLE 10.6 


Show that A TM is not recursively enumerable. 

Solution 

We have already seen that A TM is recursively enumerable (by Theorem 10.5). 
If A tm were also recursively enumerable, then A TM is recursive (by 
Example 10.5). This is a contradiction since A TM is not recursive by 
Theorem 10.5. Hence A TM is not recursively enumerable. 


EXAMPLE 10.7 


Show that the union of two recursively enumerable languages is recursively 
enumerable and the union of two recursive languages is recursive. 

Solution 

Let L { and L 2 be two recursive languages and M\, M 2 be the corresponding 
TMs that halt. We design a TM M as a two-tape TM as follows: 

1. w is an input string to M. 

2. M copies w on its second tape. 

3. M simulates M x on the first tape. If w is accepted by M h then M 
accepts H-. 

4. M simulates M 2 on the second tape. If w is accepted by Af 2 , then M 
accepts w. 

M always halts for any input w. 

Thus Lj u L 2 = T(M) and hence Lj u L 2 is recursive. 

If Lj and L 2 are recursively enumerable, then the same conclusion gives 
a proof for L x u L 2 to be recursively enumerable. As M x and M 2 need not 
halt, M need not halt. 
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SELF-TEST 

1. What is the difference between a recursive language and a recursively 
enumerable language? 

2. The DFA M is given by 

M = ({go, <72, <73 h (0, 1}, 5 <7o, {<7ol) 

where 5 is defined by the transition Table 10.1. 


TABLE 10.1 Transition Table for Self-Test 2 


State 

0 

1 


Qz 

Qi 


<?3 

Qg 

<?2 

Qo 

Qs 

Qs 

Qi 

Qz 


Answer the following: 

(a) Is (M, 001101) in A dfa ? 

(b) Is (M, 01010101) in A dfa ? 

(c) Does M e A dfa ? 

(d) Find w such that (M, w) £ A dfa . 

3. What do you mean by saying that the halting problem of TM is 
undecidable? 

4. Describe A dfa? A cfg? A csg . A tm? and HALT tm . 

5. Give one language from each of £ xb ^ csl . 

6. Give a language 

(a) which is in T csj but not in i' rl 

(b) which is in T: cfl but not in x csl 

(c) which is in £ but not in ^ r] . 

EXERCISES 

10.1 Describe the Euclid’s algorithm for finding the greatest common 
divisor of two natural numbers. 

10.2 Show that A ndfa = { (B, w) \ B is an 7V DFA and B accepts w} is 
decidable. 

10.3 Show that £ DFA = {M\M is a D FA and T(M) = 0} is decidable. 

10.4 Show that £<2 dfa = {(A, 5)|A and B are DFAs and T(A) = r(£)} is 

decidable 

10.5 Show that £ CFG is decidable (£ C fg is defined in a way similar to that 
of £ D fa)* 
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10.6 Give an example of a language that is not recursive but recursively 
enumerable. 

10.7 Do there exist languages that are not recursively enumerable? 

10.8 Let L be a language over Z. Show that only one of the following are 
possible for L and L. 

(a) Both L and L are recursive. 

(b) Neither L nor L is recursive. 

(c) L is recursively enumerable but L is not. 

(d) L is recursively enumerable but L is not. 

10.9 What is the difference between A TM and HALT ru l 

10.10 Show that the set of all real numbers between 0 and 1 is uncountable. 
(A set S is uncountable if S is infinite and there is no one-to-one 
correspondence between S and the set of all natural numbers.) 

10.11 Why should one study undecidability? 

10.12 Prove that the recursiveness problem of type 0 grammar is unsolvable. 

10.13 Prove that there exists a Turing machine M for which the halting 
problem is unsolvable. 

10.14 Show that there exists a Turing machine M over {0, 1} and a state q m 
such that there is no algorithm to determine whether or not M will enter 
the state q m when it begins with a given ID. 

10.15 Prove that the problem of determining whether or not a TM over {0,1} 
will ever print the symbol 1, with a given tape configuration, is 
unsolvable. 

10.16 (a) Show that {x | * is a set and a* £ a} is not a set. (Note that this 

seems to be well-defined. This is one version of Russell’s paradox.) 
(b) A village barber shaves those who do not shave themselves but no 
others. Can he achieve his goal? For example, who is to shave the 
barber? (This is a popular version of Russell’s paradox.) 

Hints: (a) Let S = {a | a be a set and a € a}. If S were a set, then S e S or 

S £ S. If S £ S by the ‘definition’ of S, then S e S. On the other 

hand, if S e S by the ‘definition’ of S, then S £ S, Thus we can 

neither assert that S £ S nor S e S. (This is Russell’s paradox.) 

Therefore, S is not a set. 

(b) Let S = {a | a be a person and a does not shave himself}. Let b 
denote the barber. Examine whether b e S. (The argument is 
similar to that given for (a).) It will be instructive to read the proof 
of HP of Turing machines and this example, in order to grasp the 
similarity. 

10.17 Comment on the following: “We have developed an algorithm so 
complicated that no Turing machine can be constructed to execute the 
algorithm no matter how much (tape) space and time is allowed.” 
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10.18 Prove that PCP is solvable if | 11 = 1. 

10.19 Let x = (x { .. . x n ) and y = (y { . .. y n ) be two lists of nonempty strings 
over I and | X | > 2. (i) Is PCP solvable for n = 1? (ii) Is PCP solvable 
for n = 2? 

10.20 Prove that the PCP with {(01, Oil), (1, 10), (1, 11)} has no solution. 
(Here, = 01, x 2 = 1, x 3 ~ 1, y t - Oil, y 2 = 10, y 3 = 11.) 

10.21 Show that the PCP with S = {(0, 10), (1 2 0, 0 ? ), (0 2 1, 10)} has no 
solution. [Hint: No pair has common nonempty initial substring.] 

10.22 Does the PCP with x = (b : \ ab 2 ) and y = (fe 3 , bob 3 ) have a solution? 

10.23 Find at least three solutions to PCP defined by the dominoes: 


1 


10 


10111 

111 


0 


10 


10.24 (a) Can you simulate a Turing machine on a general-purpose 
computer? Explain. 

(b) Can you simulate a general-purpose computer on a Turing 
machine? Explain. 







Computability 


In this chapter we shall discuss the class of primitive recursive functions—a 
subclass of partial recursive functions. The Turing machine is viewed as a 
mathematical model of a partial recursive function. 


11.1 INTRODUCTION AND BASIC CONCEPTS 

In Chapters 5, 7 and 9, we considered automata as the accepting devices. In 
this chapter we will study automata as the computing machines. The problem 
of finding out whether a given problem is ‘solvable’ by automata reduces to 
the evaluation of functions on the set of natural numbers or a given alphabet 
by mechanical means. 

We start with the definition of partial and total functions. 

A partial function / from X to Y is a rule which assigns to every element 
of X at most one element of Y. 

A total function from X to Y is a rule which assigns to every element of 
X a unique element of F. For example, if R denotes the set of all real numbers, 
the rule / from R to itself given by f(r) = +Vr is a partial function since/(r) 
is not defined as a real number when r is negative. But g(r) = 2 r is a total 
function from R to itself. (Note that all the functions considered in the earlier 
chapters were total functions.) 

In this chapter we consider total functions from X k to X, where 
X = {0, 1. 2. 3, ...} or X = { a , £>}*. Throughout this chapter we denote 
(0, 1, 2, . . .) by N and (a, b) by X. (Recall that X k is the set of all ^-tuples 
of elements of X.) For example. flrn, n) = m - n defines a partial function 
from N to itself as /(m, n) is not defined when m - n < 0; g(m, n) = m + n 
defines a total function from N to itself. 
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Remark A partial or total function / from X k to X is also called a function 
of k variables and denoted by /( x x , x 2 , . ■ x k ). For example* f(x h x 2 ) = 
2x { + x 2 is a function of two variables: /(L 2) = 4, 1 and 2 are called 
arguments and 4 is called a value. g(w u w 2 ) = w t h’ 2 is a function of two 
variables (}X\W 2 e X*); g(cib, aa) ~ abaci, ab , aa are called arguments and 
is a value. 

11.2 PRIMITIVE RECURSIVE FUNCTIONS 

In this section we construct primitive recursive functions over N and E. We 
define some initial functions and declare them as primitive recursive functions. 
By applying certain operations on the primitive recursive functions obtained so 
far. we get the class of primitive recursive functions. 


11.2.1 Initial Functions 

The initial functions over N are given in Table 11.1. In particular, 

5(4) = 5. Z(7) = 0 

(/f(2. 4, 7) = 4. t/ft2, 4, 7) = 2. U?( 2, 4, 7) = 7 

TABLE 11.1 Initial Functions Over N 

Zero function Z defined by Z(x) = 0. 

Successor function S defined by S(.v) = x + 1. 

Projection function UP defined by UP(x 1f . .., x n ) = .v,-. 


Note: As [//(x) = x for every x in A\ is simply the identity function. So 
Uf is also termed a generalized identity function. 

The initial functions over X are given in Table 11.2. In particular, 

nil ( abab ) = A 

cons a(abab ) = aabab 

cons b{abab) - babab 

Note: We note that cons a(x) and cons b(x) simply denote the concatenation 
of the 'constant’ string a and and the concatenation of the constant string 
b and x. 


TABLE 11.2 Initial Functions Over {a, b} 


nil (,v) = A 
cons a(x) = ax 
cons b(x) = bx 
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In the following definition, we introduce an operation on functions over X. 

Definition 11.1 If/ 1 ,/ 2 , ■ ■•,/* are partial functions of n variables and g is 

a partial function of k variables, then the composition of g with / ls / 2 , . . .,/* 
is a partial function of n variables defined by 

g(fl(x U Xy, . . X n ), / 2 O 1 , X 2 , ■ ■ ; x„), . . f k (x U X 2 , • . x n )) 

If, for example, /, f 2 and / 3 are partial functions of two variables and g 
is a partial function of three variables, then the composition of g with /^ / 2 , 
h is given by g(fi(x u x 2 ), f 2 (x ,, x 2 ), / 3 (x h x 2 )). 


EXAMPLE 11.1 


Let /j(x, y) = x + y, / 2 (x, y) = 2x, / 3 (x, y) - at and g(x, y, z) = x + y + z be 
functions over N. Then 

g(f\(x- >'), f 2 (x, y), fi(x, y)) = g(x + y, 2x, xy) 

— x + y 4* 2a + xy 

Thus the composition of g with /, / 2 , / 3 is given by a function h: 
h(x , y) = x + y + 2 x + xy 

Note: Definition 11.1 generalizes the composition of two functions. The 
concept is useful where a number of outputs become the inputs for a subsequent 
step of a program. 

The composition of g with /, .. .,f n is total when g,/i,/ 2 , *. -,f n are total. 
The function given in Example 11.1 is total as /, / 2 , / 3 and g are total. 


EXAMPLE 11.2 


Let /1 (x, y) = x - y, / 2 (x, y) = y - x and g(x, y) = x + y be functions over 
N. The function f\ is defined only when x > y and / 2 is defined only when 
y > x. So fi and / 2 are defined only wfien x = y. Hence when x = y, 

g(fi(x. y), f 2 (x, y)) = g(x - X, x - x) = g( 0, 0) = 0 

Thus the composition of g with f { and / 2 is defined only for (x, x), where 
X € N. 


EXAMPLE 11.3 


Let /i(xj, x 2 ) = xjx 2 , f 2 (x h x 2 ) = A, f 2 (x\ t x 2 ) = x\, and g(x\, x 2 , x 3 ) = x 2 x 3 
be functions over E. Then 

g(/l(*b V;), / 2 (x b X 2 ), f 2 {X\, .V 2 )) = g(x 1 X 2 , A, JCO = Ax t = Xj 

So the composition of g with / b f 2 , f 2 is given by a function h, where 
h(x i, x 2 ) = x b 
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The next definition gives a mechanical process of computing a function. 

Definition 11.2 A function fix) over N is defined by recursion if there exists 
a constant k (a natural number) and a function h(x , y) such that 

m = h An + 1) = h(n,An)) (11.1) 

By induction on n , we can define /(ft) for all n. As / (0) = A, there is basis 
for induction. Once/(ft) is known,/(ft 4- 1) can be evaluated by using (11.1). 


EXAMPLE 11.4 


Define ft! by recursion. 

Solution 

AO) = 1 and /(ft + 1) = h(n , /(ft)), where h(x, y) = S(x) * y. 

The above definition can be generalized for f(x h x 2 , . . x„, x n+1 ). We 

fix ft variables in /(x b x 2 , ..x n+ j), say. xj, x 2 , .. x„. We apply Definition 

11.2 to/(x 1; x 2 , . . x„, y). In place of k we get a function g(x h x 2 , . . x n ) 

and in place of /z(x, y), we obtain h(x h x : , . . x„, y, f(x h .. x !V y)). 

Definition 11.3 A function / of ft 4 1 variables is defined by recursion if 

there exists a function g of n variables, and a function h of ft 4 2 variables, 
and / is defined as follows: 

f(x h x 2 , . . x H , 0 ) = g(x h x 2 , . . .. x ;z ) ( 11 . 2 ) 

/(Xi . . . x H , y 4 1) = /z(x b x 2 , . . ., x H , y, f(x h x 2 , . . x ;z , y)) (11.3) 

We may note that/can be evaluated for all arguments (x b x 2 , . . x n , y) 
by induction on y for fixed x b x 2 , .. x„. The process is repeated for every 

Xb *2- • • •• x,r 

Now we can define the primitive recursive functions over N. 

11.2.2 Primitive Recursive Functions Over N 

Definition 11.4 A total function / over N is called primitive recursive 
(i) if it is any one of the three initial functions, or (ii) if it can be obtained 
by applying composition and recursion a finite number of times to the set of 
initial functions. 


EXAMPLE 11.5 


Show that the function /(x, y) = x 4 y is primitive recursive. 

Solution 

fi is a function of two variables. If we want/i to be defined by recursion, we 
need a function g of a single variable and a function h of three variables. 

/t(x, 0) = x 4 0 = x 
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By comparing/i(x, 0) with L.H.S. of (11.2), we see that g can be defined by 

g (x) = X = U j(x) 

Also. fi(x, y + 1) = x + (y + 1) = (x + y) + 1 = /)(*, y) + 1 
By comparing /j(_x, y + 1) with L.H.S. of (11.3), we have 

h(x, y, fix, v)) = fi(x, y) + 1 = Si fix, >>)) = S(Ui(x, y, fix, >■))) 

Define /z(x y, z) = S(Uf(x, y, z)). As g = [//, it is an initial function. The 
function h is obtained from the initial functions Uf and 5 by composition, and 
by recursion using g and h. Thus f x is obtained by applying composition and 
recursion a finite number of times to initial functions [//. U ’3 and S. So/ } is 
primitive recursive. 

Note: A total function is primitive recursive if it can be obtained by applying 
composition and recursion a finite number of times to primitive recursive 
functions /j, / 2 , . . ., f m . This is clear as each fj is obtained by applying 
composition and recursion a finite number of times to initial functions. 


EXAMPLE 11.6 


The function / 2 (a\ y) = x * y is primitive recursive. 

Solution 

As multiplication of two natural numbers is simply repeated addition, f 2 has 
to be primitive recursive. We prove this as follows: 

h(x. 0 ) = 0 , f 2 (x, y + 1 ) = * * (y + 1 ) = f 2 (x, y) + x 

i.e. f 2 (x, y + 1 ) = ,/i(/:( a-- >’), x). Comparing these with (11.2) and (11.3), we 
can write 

/ 2 (x 0) = Z(v) and / 2 (x v + 1) = f\(U 2 (x, y, / 2 (a, y)), U f(x, y, / 2 (x, y)))) 
By taking g - Z and h defined by 

h(x. y, z) = /i(t/ 3 ? (A y, z), t/iV, y, 2 )) 

we see that f 2 is defined by recursion. As g and h are primitive recursive, / 2 
is primitive recursive (by the above note). 


EXAMPLE 11.7 


Show that fix, v) = x y is a primitive recursive function. 

Solution 

We define 

fix, 0 ) = 1 

fix, y + 1) = ,r * f{x. y) 

= U{{x, y, fix, >■)) * Uiix, y, fix, y)) 
Therefore, fix. y) is primitive recursive. 
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EXAMPLE 118 


Show that the following functions are primitive recursive: 

(a) The predecessor function p(x) defined by 

p(x) ~ x - l if x & 0, p(x) = 0 if x = 0. 

(b) The proper subtraction function - defined by 

x - y = x - y if x > y and x y = 0 if x < y. 

(c) The absolute value function | | given by 

\x\ = x if x > 0, \x\ = -x if x < 0. 

(d) min (x, y), i.e. minimum of x and y. 

Solution 

(a) p{0 ) = 0 and p(y + 1) = Ui(y, p(y)) 

(b) x - 0 = x and x - (y + 1) = p(x - y) 

(c) | x - y | = (x - y) + (y - x) 

(d) min(x, y) = x - (x - y) 

The first function is defined by recursion using an initial function. So it is 
primitive recursive. 

The second function is defined by recursion using the primitive recursive 
function p and so it is primitive recursive. Similarly, the last two functions are 
primitive recursive. 


11.2.3 Primitive Recursive Functions Over { a , b } 

For constructing the primitive recursive function over { a , b }. the process is 
similar to that of function over N except for some minor modifications. It 
should be noted that A plays the role of 0 in (11.2) and ax or bx plays the role 
of y + 1 in (11.3). Recall that E denotes { a , b}. 

Definition 11.5 A function/(x) over X is defined by recursion if there exists 
a ‘constant* string vr e X* and functions h { (x, y) and h 2 (x, y) such that 

/(A) = w 

f(ax) - htfx, f(x)) 

f(bx) = h 2 (x, f(x)) 


(b : and h 2 may be functions in one variable.) 


(11.4) 

(11.5) 
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Definition 11.6 A function/^, x 2 , . . x n ) over £ is defined by recursion 
if there exist functions g(x h . . .. x n _ { ), h x {x x , . . x n + 1 )> h(xi, ■ ■ •, x n+l ), 
such that 

/(A, x 2 , . . x n ) = g(x 2 , • • x n ) (11*6) 

Aax\* • • - x n ) = hi(Xi, x 2 , . . x tv f(x h x 2 , * * x n )) (11.7) 
f(bx h x 2 , . . x n ) = /i 2 (^b * • •* Ap Ai)) 

(h\ and h 2 may be functions of m variables, where m < n + 1 .) 

Now we can define the class of primitive recursive functions over X. 

Definition 11.7 A total function/is primitive recursive (i) if it is any one 
of the three initial functions (given in Table 11.2), or (ii) if it can be obtained 
by applying composition and recursion a finite number of times to the initial 
functions. 

In Example 11.9 we give some primitive recursive functions over I. 

Note: As in the case of functions over h\ a total function over E is primitive 
recursive if it is obtained by applying composition and recursion a finite number 
of times to primitive recursive function / b f 2 , . . .. f n . 


EXAMPLE 11.9 


Show that the following functions are primitive recursive: 

(a) Constant functions a and b (i.e. a(x) = a , b(x) = b) 

(h) Identity function 

(c) Concatenation 

(d) Transpose 

(e) Head function (i.e. head ( a { a 2 • ■ a n ) = a{) 

(f) Tail function (i.e. tail (ajfl 2 . . . a n ) = a 2 . . a„) 

(g) The conditional function “if X[ & A. then x 2 else x 2 ” 

Solution 

(a) As a(x) = cons a (nil (x)). the function a(x) is the composition of the 
initial function cons a with the initial function nil and is hence 
primitive recursive. 

(b) Let us denote the identity function by id. Then, 

id(A) = A 
id(ox) = cons a(x) 
id (bx) - cons b(x) 

So id is defined by recursion using cons a and cons b. Therefore, the 
identity function is primitive recursive. 

(c) The concatenation function can be defined by 

concat(x b x 2 ) = x x x 2 
concat(A. x 2 ) = id(x 2 ) 
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concat(ar b x 2 ) = cons a (concatC^, x 2 )) 

concat(te 1? x 2 ) = cons b (concat(x 1? x 2 )) 

So concat is defined by recursion using id, cons a and cons b. 
Therefore, concat is primitive recursive, 

(d) The transpose function can be defined by trans(x) = x T . Then 

trans(A) = A 

trans(ax) = concat(trans(A). a(x)) 
trans(fe) = concat(trans(x), b(x)) 

Therefore, trans(x) is primitive recursive. 

(e) The head function head(;t) satisfies 

head(A) = A 
head(ox) = a(x) 
head(foc) = b(x) 

Therefore, headfx) is primitive recursive. 

(f) The tail function tail(v) satisfies 

tail(A) = A 
tail(ax) = id(x) 
tail(fcr) = idfx) 

Therefore, tail(jc) is primitive recursive. 

(g) The conditional function can be defined by 

cond(;q, x 2 , x 2 ) = “if x\ ^ A then x 2 else x^ 

Then, 

cond(A, x 2 , x 3 ) = id(x 3 ) 
cond(<xq, x 2 , x 2 ) = id(:v 2 ) 
cond(Z 7 X!, x 2 , x 2 ) = id(x 2 ) 

Therefore, idfo, x 2 , x$) is primitive recursive. 


11.3 RECURSIVE FUNCTIONS 

By introducing one more operation on functions, we define the class of 
recursive functions, which includes the class of primitive recursive functions. 

Definition 11.8 Let g(x h jc 2 . . . x m y) be a total function over N. g is a 
regular function if there exists some natural number y 0 such that g(xi, x 2 , . . ., 
x n . y 0 ) = 0 for all values x b x 2 , . . x n in A 7 . 
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For instance, g (x, y) = min(x, y) is a regular function since g(x, 0) = 0 
for all x in N. But /(x, y) = | x - y | is not regular since /(x, y) = 0 only when 
x = y, and so we cannot find a fixed y such that /(x, y) - 0 for all x in N. 

Definition 11.9 A function f(x h x 2 , ■.x 7? ) over N is defined from a total 
function g(x b x 2 * . . x n , y) by minimization if 

(a) fix h x 2? . . .. x n ) is the least value of all v’s such that g(x h x 2 ,.. x n , y) - 0 
if it exists. The least value is denoted by p y (g(x h x 2 , • . x m y) - 0). 

(b) fix h x 2 .. . .. x n ) is undefined if there is no y such that g(x h x 2 ... x n , y) = 0 . 

Note: In general / is partial But, if g is regular then / is total 

Definition 11.10 A function is recursive if it can be obtained from the initial 
functions by a finite number of applications of composition, recursion and 
minimization over regular functions. 

Definition 11.11 A function is partial recursive if it can be obtained from 
the initial functions by a finite number of applications of composition, 
recursion and minimization. 


EXAMPLE 11.10 


fix) = x/2 is a partial recursive function over N. 

Solution 

Let g(x, v) = 12y - x |, where 2y - x = 0 for some y only when x is even. 
Let fi(x) = t U y (\2x - x\ = 0). Then f x (x) is defined only for even values of x 
and is equal to x/2. When x is odd, fi(x) is not defined. f x is partial recursive. 
As fix) = x/2 = f\(x). f is a partial recursive function. 

The following example gives a recursive function which is not primitive 
recursive. 


EXAMPLE 11.11 


The Ackermann's function is defined by 

A(0, >•)=>• + 1 (11.8) 

A(x + 1, 0) = A(x, 1) (11.9) 

A(x + 1, v + 1) = A(x. A(x + L y)) (11.10) 

A(x. y) can be computed for every (x, y), and hence A(x, y) is total. 

The Ackermann’s function is not primitive recursive but recursive. 




EXAMPLE 11.12 


Compute A( 1. 1). A(2, 1), A(l, 2), Ad, -)■ 

Solution 


A(l, 1) = A(0 +1.0+1) 


o 

t —1 

o 

< 

II 

bv ( 11 . 10 ) 

= A(0, A(0. 1)) 

by (11.9) 

/-“s 

ci 

© 

II 

by ( 11 . 8 ) 

= 3 

by ( 11 . 8 ) 

Ad, 2) = A(0 + 1,1 + 1) 

= A(0, A(l, 1)) 

by ( 11 . 10 ) 

= A(0, 3) 

= 4 

by ( 11 - 8 ) 

A(2, 1) = A(1 + 1-0+ 1) 

= A(l, A(2, 0)) 

by ( 11 . 10 ) 

II 

by (11.9) 

= A(l, 3) 

= A(0 + 1. 2 + 1) 

= A(0. A(l, 2)) 

by ( 11 . 10 ) 

= A(0, 4) 

= 5 

A(2. 2) = A(1 + 1. 1 + 1) 

= A(l, A(2, 1)) 

by ( 11 . 10 ) 

= A(l, 5) 

A(l. 5) = A(0 + 1 . 4 + 1) 

= A(0, Ad, 4)) 

by ( 11 . 10 ) 

= 1 + Ad. 4) 

by ( 11 . 8 ) 

= 1 + A(0 + 1. 3 + 

1 ) 

= 1 + A(0. A(l, 3)) 
= 1 + 1 + A(l, 3) 

= 1 + 1 + 1+ A(l, 

2 ) = 1 + 1 + 1 


As A(2. 2) = A(l : 5). we have A(2, 2) - 7 
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So far we have dealt with recursive and partial recursive functions over 
N\ We can define partial recursive functions over Z using the primitive 
recursive predicates and the minimization process. As the process is similar, 
we will discuss it here. 

The concept of recursion occurs in some programming languages when a 
procedure has a call to the same procedure for a different parameter. Such a 
procedure is called a recursive procedure. Certain programming languages like 
C, C++ allow recursive procedures. 


11.4 PARTIAL RECURSIVE FUNCTIONS AND TURING 
MACHINES 

In this section we prove that partial recursive functions introduced in the earlier 
sections are Turing-computable. 


11.4.1 Computability 

In mid 1930s. mathematicians and logicians were trying to rigorously define 
computability and algorithms. In 1934 Kurt Godel pointed out that primitive 
recursive functions can be computed by a finite procedure (i.e. an algorithm). 
He also hypothesized that any function computable by a finite procedure can 
be specified by a recursive function. Around 1936, Turing and Church 
independently designed a 'computing machine' (later termed Turing machine) 
which can carry out a finite procedure. 

For formalizing computability, Turing assumed that, while computing, a 
person writes symbols on a one-dimensional paper (instead of a two- 
dimensional paper as is usually done) which can be viewed as a tape divided 
into cells. He scans the cells one at a time and usually performs one of the three 
simple operations, namely (i) writing a new symbol in the cell he is scanning, 
(ii) moving to the cell left of the present cell, and (iii) moving to the cell right 
of the present cell. These observations led Turing to propose a computing 
machine. The Turing machine model we have introduced in Chapter 9 is based 
on these three simple operations but with slight variations. In order to introduce 
computability, we consider the Turing machine model due to Post. In the 
present model the transition function is represented by a set of quadruples (i.e. 
4-tuples), whereas the transition function of the model we have introduced in 
Chapter 9 can be represented by a set of quintuples (5-tuples). For example, 
8{q r a) = (cjj. a , p) is represented by the quintuple qjaafiqj. Using the model 
specifying the transition function in terms of quadruples, we define Turing- 
computable functions and prove that partially recursive functions are Turing- 
computable. 
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11.4.2 A Turing Model for Computation 

As in the model introduced in Chapter 9, Q , q 0 and T denote the set of states, 
the initial state, and the set of tape symbols, respectively. The blank symbol b 
is in r. The only difference is in the transition function. In the present model 
the transition function represents only one of the following three basic 
operations: 

(i) Writing a new symbol in the cell scanned 

(ii) Moving to the left cell 

(iii) Moving to the right cell 

Each operation is followed by a change of state. Suppose the Turing machine 
M is in state q and scans a r If a f is written and M enters q\ then this basic 
operation is represented by the quadruple qaftfl'. Similarly, the other two 
operations are represented by the quadruples qa\Lc{ and qa { Rq'. Thus the 
transition function can be specified by a set P of quadruples. As in Chapter 9, 
we can define instantaneous descriptions, i.e. IDs. 

Each quadruple induces a change of IDs. For example, qafljq induces 

Oqa-.fi j— aq'cijfi 

The quadruple qcqLq' induces 

a \ a 2 • * • a i-\ c l a i * • ♦ a n\ - a \-\ a i * • • a n 

and qciiRq induces 

a\ . . . cij^qcii . . . a n |— . . . a i qa }+[ . . . ci n 

When we require M to perform some computation, we ‘feed 5 the input by 
initial tape expression denoted by X. So q$X is the initial ID for the given 
input. For computing with the given input X , the Turing machine processes 
X using appropriate quadruples in P. As a result, we have q$X = ID/ |— TD 2 
|— .... When an ID. say ID,,, is reached, which cannot be changed using any 
quadruple in P, M halts. In this case, ID ;? is called a terminal ID. Actually, 
ciqiCCp is a terminal ID if there is no quadruple starting with q^a. The terminal 
ID is called the result of X and denoted by Res(X). The computed value 
corresponding to input X can be obtained by deleting the state appearing in it 
as also some more symbols from Res(X). 


11.4.3 Turing-computable Functions 

Before developing the concept of Turing-computable functions, let us recall 
Example 9.6. The TM developed in Example 9.6 concatenates tw'o strings a 
ard fi. Initially, a and /3 appear on the input tape separated by a blank b . 
Finally, the concatenated string a/3 appears on the input tape. The same 
method can be adopted with slight modifications for computing f(x h . . x m ). 
Suppose we want to construct a TM wdich can compute/(ay, . . x m ) over 
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N for given arguments a h . . .. a m . Initially, the input a x , a 2 , . . a m appears 
on the input tape separated by markers x u . . x m . The computed value 
f(a x . . . .. a m ), say, c appears on the input tape , once the computation is over. 
To locate c we need another marker, say y. The value c appears to the right 
of x m and to the left of y. To make the construction simpler, we use the tally 
notation to represent the elements of N. In the tally notation, 0 is represented 
by a string of b's. A positive integer n is represented by a string consisting 
of n Vs. So the initial tape expression takes the form l cl Vx x l a2 x 2 . . . 1 Cim x m by. 
As a result of computation, the initial ID ^oTTxyFAw .. . lVc m by is changed 
to a terminal ED of the form l ai x x TAxs . . . l Ll}n x m Yqy for some q e Q. In 
fact, the position of q' in a teiminal ID is immaterial and it can appear 
anywhere in Res(X). The computed value is found between x m and y. 
Sometimes we may have to omit the leading b's. 

We say that a function f(x u . . x m ) is Turing-computable for arguments 
a x , . . .. ci m if there exists a Turing machine for which 

4 0 i t7 W t--ax ... i a '"x m by ]r- ro /; 

where XD„ is a terminal ID containing f(a x , . . ci m ) to the left of y. 

Our ultimate aim is to prove that partial recursive functions are Turing- 
computable. For this purpose, first of ail we prove that the three initial primitive 
recursive functions are Turing-computable. 


11.4.4 Construction of the Turing Machine That 
Can Compute the Zero Function Z 

The zero function Z is defined as Z(a x ) - 0 for all a { > 0. So the initial tape 
expression can be taken as X - l a Kx\by. As we require the computed value 
Z(a { ). namely 0. to appear to the left of y, we require the machine to halt 
without changing the input. (Note that 0 is represented by b in the tally 
notation.) 

Thus we define a TM by taking Q = {<7 0 , c/^}, T = {b , 1 , y}, 

X = l LI KX[by. P consists of q^bRci^ q^lRq^ qoX\x x qi. qobRqo and are 

used to move to the right until x x is encountered. q<yx x xiq\ enables the TM to 
enter the state q x . M enters q x without altering the tape symbol. In terms of 
change of IDs, we have 

q {) V\x x by l a iqoX x by |— l a iq x x { by 

As there is no quadmple starting with q x% M halts and Res(X) = l a k] x x x by. 
By deleting q x in Res(X), we get \ a ^x x by (which is the same as X) yielding 0 
(given by b). 

Note: We can also represent the quadruples in a tabular form which is 
similar to the transition table obtained in Chapter 9. In this case we have to 
specify (i) the new symbol written, or (ii) the movement to the left (denoted 
by L), or (iii) the movement to the right (denoted by R). So we get 
Table 11.3. 
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TABLE 11.3 Representation of Quadruples 


State 

b 1 x-\ 

y 

Vo 

(R. q 0 ) (R. q 0 ) (.v,, p,) 


<7i 




11.4.5 Construction of the Turing Machine for 
Computing—The Successor Function 

The successor function S is defined by S(a 4 ) = a x + 1 for all a x > 0. So the 
initial tape expression can be taken as X = l a lx { by (as in the case of the zero 
function). At the end of the computation, we require T' 1+1 to appear to the left 
of y. Hence we define a TM by taking 

Q = . . .. q 9 } t r = {K 1. as, y}. X = TAr, by 

where P consists of 

(i) q 0 bRq 0 , q C) lbq h q 0 x ] Rq 6 

(ii) qibRqi, q\lRq h q\X Y Rq h q^ylq, 

(iii) q 2 lRq 2 ^ qybyq^ 

(iv) q 3 bLq 3 . q 3 lLq 3 . q 3 yLq 3 . q%X\Lq 4 

(v) q 4 \Lq 4 . q 4 blq 5 , 

(vi) q 5 lRq 0 . 

(vii) q 6 bRq 6 , q 6 lRq 6 . q (r x x Rq 6 . q&Lcjj 
(viii) q 7 lLq 7 , q 7 blq& 

(ix) q s bLq^ q s \Lq s . q%yLcj^ q%x { x x q 9 . 

The corresponding operations can be explained as follows: 

(i) If M starts from the initial ID, the head replaces the first 1 it 
encounters by b. Afterwards the head moves to the right until it 
encounters y (as a result of q 0 lbq h q x bRq u q x \Rq x , q\X\Rq{). 

(ii) y is replaced by 1 and M enters q 2 . Once the end of the input tape is 
reached, y is added to the next cell M enters q 3 (qiylq 2 , 
q2byq 3 )- 

(iii) Then the head moves to the left and the state is not changed until x x 
is encountered ( q 3 yLq 3 , q 3 yLq 3 . q 3 bLq 3 ). 

(iv) On encountering ay. the head moves to the left and M enters q 4 . Once 
again the head moves to the left till the left end of the input string is 
reached (q 3 x x Lq 4 , q x lLq 4 ). 

(v) The leftmost blank (written in point (i)) is replaced by 1 and M enters 
q 5 (q 4 blq 5 ). 

Thus at the end of operations (i)-(v). the input part remains unaffected but 

the first 1 is added to the left of y. 
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(vi) Then the head scans the second 1 of the input string and moves right, 
and M enters q 0 (q 5 lRq 0 ). 

Operations (i)—(vi) are repeated until all the Ts of the input part 
(i.e. in Th) are exhausted and 11 ... 1 times) appear to the left 
of y. Now the present state is q 0 , and the current symbol is x \. 

(vii) M in state q 0 scans x h moves right, and enters q 6 . It continues to move 
to the right until it encounters y (q 0 x { Rq 6 , q 6 bRq& q 6 \Rq 6 , q^Rq^). 

(viii) On encountering v, the head moves to the left and M enters q 1 , after 
which the head moves to the left until it encounters b appearing to the 
left of l fl i of the output part. This b is changed to 1, and M enters 
q%(q 6 yLqn . q\\Lqi, ^Ms)- 

(ix) Once M is in g 8 , the head continues to move to the left and on scanning 
M enters q 9 . As there is no quadruple starting with r/ 9 , M halts 
0 q%bLq %. q%lLq<&, q%x { x x q 9 ). 

The machine halts, and the terminal ID is l u 'q 9 x x Y l Y x y. For example, let 
us compute S( 1). In this case the initial ED is q 0 Lx l by. As a result of the 
computation, w r e have the following moves: 

q 0 iX]by |— q\bx\by |— bq { x x by 
j— bx { q x by |— bx { bq x y |— bx x bq 2 l 
|— bx x b\q 2 b |— bx\b\qyy |— bx x bq 2 ly 
p- bqyx x bly |— q 4 bx { bly |— q 5 lx x bly 
h 1 - 4(^44 P- Ixyblq^y (— \.x-bq-,\y 
!— Lx^jjbly |— l.Vj^gilv |— 11)' 

f— 4^1 iiy 

Thus. M halts and 5(1) = 2 (given by 11 to the left of y). 

11.4.6 Construction of the Turing Machine for 
Computing the Projection L/f 1 

Recall .... a m ) = a t . The initial tape expression can be taken as 

X = ¥%¥%x 2 . . . 1 a »'x m by 

We define a Turing machine by taking Q = {q 0 , . . q%) 

T - {b, 1. .Vj, .... x mr y}. P consists of 
qozRqo for all z € Y - {*,-} 

qoXiLq u q\bbq g , qlbq 2 

q 2 zRq 2 for all z e Y - {v} 

qyylqy q 2 \Rq^ q^byq^ 
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q+zLq 4 for all z e F - {x ? } 

q 4 X;Lq^ q 5 lLq 5i q^blq^ r/ 6 lZx/ 7 . qilbq 2 

q~zRq% for all z <= F - { 1 } 

The operations of M are as follows: 

(i) M starts froin the initial ID and the head moves to the right until it 
encounters XjiqQzRqo)- 

(ii) On seeing x b the head moves to the left (q^Lq^. 

(iii) The head replaces 1 (the rightmost 1 in F'O by b {q^bqi). 

(iv) The head moves to the right until it encounters y and replaces y by 1 
(q 2 zRq 2 , z € F - {>■} and qyylq^). 

(v) On reaching the right end, the head scans b and replaces this b by 

y (q^byq 4 ). 

(Vi) The head moves to the left until it scans the symbol b . This b is 
replaced by 1 (q 4 zLq 4 , z e T - {x ; }, q±XjLq $, q 5 blq 6 ). 

(vii) The head moves to the left and one of the Fs in l a ' ; is replaced by 
b . M reaches q 2 (g 6 lLq 7 . q-lbq 2 ). 

As a result of (i)—(vii), one of the Fs in l a ‘ is replaced by b and 
1 is added to the left of y. Steps (iv)-(vii) are repeated for all l’s 
in FF 

(viii) On scanning x^. the head moves to the right and M enters 
q% (qjX;_[Rq%). 

As there are no quadruples starting with q^ the Turing machine M halts. 
When i & 1 and a } =£ 0 . the terminal ID is FTtj . . . x M g§FFq . . . x n b\ Ui y. 
For example, let us compute U 2 (L 2, 1): 

< 7 ol*i 1 F\x 1 x 3 by p- lxj 11 q 0 x 2 1 x 2 by 

|— Lx[lq\lx 2 hc 2 by j— lx[lq 2 bx 2 lx 2 by 

lxiibx 2 lx 2 bq 2 y }— lx\lbx 2 lx 2 bq 2 l 

|— lxilbx 2 lx 2 blq 2 b |— Ixq lbx 2 lx 2 blq 4 y 

p~ lxj Ibq^xAx^bly |— lx\ lq$bx 2 lx 2 bly 

|— \x\lq§\x 2 \x 2 bly I— Ixiq-jllxAx^bly 

|— Ixj^ 2 ^ 1 x 2 lx 3 ^ 1 v 

From the above derivation, we see that 

lx] 1 q 2 bx 2 lx ? by p- lx { q 2 b lx 2 1 x 2 b ly 

Repeating the above steps, we get 

\x { q 2 blx 2 lx 2 b\y p- Fqgglb^lxxMiv 

It should be noted that this construction is similar to that for the successor 
function. While computing U - n , the head skips the portion of the input 
corresponding to ci h j * i. For every 1 in FF 1 is added to the left of y. 
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Thus we have shown that the three initial primitive recursive functions are 
Turing-computable. Next we construct Turing machines that can perform 
composition, recursion, and minimization. 


11.4.7 Construction of the Turing Machine That 
Can Perform Composition 

Let /i(A'i. jc 2? . . .. x„j ), . . *,f k (x [f .. x m ) be Turing-computable functions. Let 
g(y i, . . .. ») be Turing-computable. Let h(x h . . x m ) - g(f\{x \, .. 

4(.T{, .. .. .v 7H )). We construct a Turing machine that can compute h(ci u . . ., a m ) 
for given arguments a 1? .... a m . This involves the following steps: 

Step 1 Construct Turing machines M h . . ., M k which can compute/ b . . 
respectively. For the TMs M h . . .. M h let F = {1, b, x h x 2 , . . .. x 7J , y} and 
X = .\\ ?i Zn\ But the number of states for these TMs will vary. 

Let 771 + 1.njt + 1 be the number of states for M h . . ., M h respectively. 

As usual, the initial state is q 0 and the states for are q 0 , . . ., q Uj . As in the 
earlier constructions, the set P; of quadruples for M, is constructed in such a 
way that there is no quadruple starting with q Uj . 

Step 2 Let f;(a b . . a w ) = fy for i = L 2, .... k. At the end of step 1, 
w'e have Mf s and the computed values b{ s. As g is Turing-computable, we 
can construct a TM M m which can compute g(£h, ..., For M* + ], 

T = {1. b. x\. . . .. x' m . y}, X' = ri.v', . . . lW m foy 

(We use different markers for M k+l so that the TM computing h to be 
constructed need not scan the inputs a\. . . a„ r ) Let n k+l + 1 be the number 
of states of M k+1 . As in the earlier constructions, M k+ i has no quadruples 
starting with q k+] . 

Step 3 At the end of step 2, w'e have TMs M i. .... M h which give 
b\. .... b m and g{b\. . . .. b k ) = c (say), respectively. So we are able to 
compute h(ai. . . ., a m ) using k + 1 Turing machines. Our objective is to 
construct a single TM M k+2 which can compute h(a h . . a m ). We outline the 
construction of M without giving the complete details of the encoding 
mechanism. For M. let 

r = {1- b. .V[. . . .. x„„ x\ . x' m y} 

X= r'i.x: i l"-.Y 2 . .. 1 a >"x m by 

(i) In the beginning, M simulates Mu As a result, the value b\ = 
.... a w ) is obtained as output. Thus we get the tape expression 
LApF-vy . . . V lm x m l bl y which is the same as that obtained by M x 
w ? hile halting. M does not halt but changes y to x\ and adds by to the 
right of x'j. The head moves to the left to reach the beginning of X. 
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(ii) The tape expression obtained at the end of (i) is 

V'x^x: . . . l Um x m l h Kx\by 

The construction given in (i) is repeated, i.e. M simulates M 2 . . . M h 
changes y to x h and adds by to the right of x- r After simulating M h 
the tape expression is 

X' = 1"1jc, ... l a "’x m l h lx\ ... 1 7, *->XV|1 h x' k by 

Then the head moves to the left until it is positioned at the cell having 
1 just to the right of x„ r 

(iii) M simulates M m . M k+l with initial tape expression X f halts with the 
tape expression . . . l hk x m Ty. As a result, the corresponding tape 
expression for M is obtained as 

. . . \ a ^x m l b Kx\ . . . 1 b tf k l r y 

(iv) The required value is obtained to the left of y. but . . . l l ' k x k also 
appears to the left of c. M erases all these symbols and moves Ty just 
to the right of x m . The head moves to the cell having x m and M halts. 
The final tape expression is TT^T-as . . . T 7m x„,Ty. 


11.4.8 Construction of the Turing Machine That 
Can Perform Recursion 

Let gCv,. x m ). • - ■* >W:) be Turing-computable. Let f(x h . . 

be defined by recursion as follows: 

f(x u . . .. 0) = g(x i . . . x m ) 

f{x\* ■ ■ •« x mf a + 1 ) — h (x i . .... x m , y , f{x\i • ■ •? Tj/? y)) 

For the Turing machine M, computing f(a h . . ., a m , c). (say k), X is 
taken as 

1^1 . . • l a -x m l% l+l by 

As the construction is similar to the construction for computing 
composition, we outline below the steps of the construction. 

Step 1 Let M simulate the Turing machine M' which computes g(a\, . . .. ci w ). 
The computed value, namely g(a h . . ., a m ). is placed to the left of y. If 
c ~ 0. then the computed value g{a h . . a m ) is /(c/j, . . .. a m , 0). The head 
is placed to the right of x m and M halts. 

Step 2 If c is not equal to zero, V to the left of v m+i is replaced by b r . The 
marker y is changed to x m+ 2 and by is added to the right of x m+2 - The head 
moves to the left of l" 1 . 

Step 3 h is computable. M is allowed to compute h for the arguments c/j, . , 
a nr 0. g{ci\. . . ., a m ) which appear to the left of x\* . . x nv x m+] , x m+2 , 
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respectively. The computed value is /(<7j, . . a nv 1). And f(a h . . ., a nv 2) 

. . . /(flj, . . ., a m , c) are computed successively by replacing the rightmost b 
and computing h for the respective arguments. 

The computation stops with a terminal ID. namely 

bl l \x jT- ... q f Yx n+] l k }\ k = f(a h . . ., a uv c) 

11A9 Construction of the Turing Machine That 
Can Perform Minimization 

When /(.v 1? . . x m ) is defined from g(x l . . . ., x nv y) by minimization, 
/(jcj. . . x m ) is the least of all k ’s such that g(x { , . . ., x m , k) = 0. So the 
problem reduces to computing g(a l¥ . . ., ci m , k) for given arguments 
ci\, . . a m and for values of k starting from 0. f(a b . . ., a m ) is the first k 
for which g(a u .... a m , k) = 0. Hence as soon as the computed value of 
g(a 1? . . .. a m ,y) is zero, the required Turing machine M has to halt. Of 
course, when no such y exists. M never halts, and f(a x , . . a m ) is not defined. 

Thus the construction of M is in such a way that it simulates the TM that 
computes g(a } . . . .. a f}r k) for successive values of k. Once the computed value 
g(a]. . . .. a m . k) = 0 for the first time, M erases by and changes x m+l to y. The 
head moves to the left of x m and M halts. 

As partial recursive functions are obtained from the initial functions by a 
finite number of applications of composition, recursion and minimization 
(Definition 11.11) by the various constructions we have made in this section, 
the partial recursive functions become Turing-computable. 

Using Godel numbering which converts operations of Turing machines 
into numeric quantities, it can be proved that Turing-computable functions are 
partial recursive. (For proof, refer Mendelson (1964).) 

11.5 SUPPLEMENTARY EXAMPLES 


EXAMPLE 11.13 


Show that the function f(x\, Xy ■ ■ .. x„) = 4 is primitive recursive. 

Solution 

4 = S 4 ( 0) 

= 5 4 (Z(.r,)) 

= S 4 (Z((y i "(.V l . A';. . . .. A,,))) 
i.e. 

fix 1 - A‘2,.V-„) = S 4 {Z(U"(x u a 2 , . . X,,))). 

As / is the composition of initial functions, / is primitive recursive. 
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EXAMPLE 11.14 


If f(x h x 2 ) is primitive recursive, show that g(x { . x 2 , x 3 , x 4 ) = f(x h x 4 ) is 
primitive recursive. 

Solution 

g(x u x 2 , x 3 , x 4 ) 

= f(*b * 4 ) 

= f(Ui(x u x 2 , V 3 , x 4 ). uf(x u X 2 , * 3 , * 4 )) 

U 4 and U 4 are initial functions and hence primitive recursive. / is primitive 
recursive. As the function g is obtained by applying composition to primitive 
recursive functions, g is primitive recursive (by the Note appearing at the end 
of Example 11.5). 


EXAMPLE 11.15 


If f(x, y) is primitive recursive, show that g(x, y) = f(4. y) is primitive 
recursive. 

Solution 

Let h(x. y) = 4. h is primitive recursive by Example 11.13. 

g(A y) 

= /(4 f v) 

= /(/z(a\ y). £/ 2 2 (x. y)) 

As / and g are primitive recursive and Uf is an initial function, g is 
primitive recursive. 


EXAMPLE 11.16 


Show that fix. y) = ,ry 4 + 7xy 3 + 4y~ is primitive recursive. 

Solution 

As fi(x. y) - x + y is primitive recursive (Example 9.5), it is enough to prove 
that each summand of fix. y) is primitive recursive. 

But, 

xyv 4 = Uf{x, y) * Uf(x, y) * Ufix, y) * Ufi(x. y) * £/ 2 2 (x, y) * f/ 2 2 (x. y) 

As multiplication is primitive recursive. g(x, y) = x 2 y 4 is primitive recursive. 

As /?(x. y) = xy' is primitive recursive, 7xy 3 = xy" + ♦ • • + xy 3 is primitive 
recursive. Similarly, 4y" is primitive recursive. 
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SELF-TEST 

Choose the correct answer to Questions 1 - 10 . 

1. S(Z( 6 )) is equal to 

(a) Ufr 1. 2. 3) 

(b) Uj\l. 2. 3) 

(c) uia, 2 ,3) 

(d) none of these. 

2. Cons a(y) is equal to 

(a) a 

(b) ya 

(c) ay 

(d) a 

3. min(x, y) is equal to 

(a) x - (x -v) 

(b) y - (y — x) 

(c) x — y 

(d) y - x 

4. A(l, 2) is equal to 

(a) 3 

(b) 4 

(c) 5 

(d) 6 

5. /(x) = x/3 over N is 

(a) total 

(b) partial 

(c) not partial 

(d) total but not partial. 

6 . i/ a { 4 }( 3 ) is equal to 

(a) 0 

(b) 3 

(c) 4 

(d) none of these. 

7. sen(x) takes the value 1 if 

(a) x < 0 

(b) x < 0 

(c) x > 0 

(d) x > 0 

8. y/ A + y/ B = y/ AuB if 

(a ) Av B = A 

(b) A u B = B 

(c) A n B = A 

(d) A n B = 0 
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9. C/ 2 4 (5(4), 5(5), 5(6), Z(7)) is 

(a) 6 

(b) 5 

(c) 4 

(d) 0 

10. If g(x t y) = mm(x, y) and h(x, y) = \x - y|, then: 

(a) Both functions are regular functions. 

(b) The first function is regular and the second is not regular. 

(c) Neither of the functions is regular. 

(d) The second function is not regular. 

State whether the Statements 11-15 are true or false. 

11 . f(x, y) = x + y is primitive recursive. 

12. 3 - 4 = 0. 

13. The transpose function is not primitive recursive. 

14. The Ackermann’s function is recursive but not primitive recursive. 

15. A(2. 2) = 7. 


EXERCISES 


11.1 Test which of the following functions are total. If a function is not 
total, specify the arguments for which the function is defined. 

(a) /( x) = x/3 over N 

(b) fix) = l/(x - 1 ) over N 

(c) f(x) = x 2 - 4 over N 

(d) f(x) = x + 1 over N 

(e) fix) = x 2 over N 

11.2 Show that the following functions are primitive recursive: 


( a ) X{0)00 


|l ifAT = 0 
[0 if x ^ 0 


(b) f(pc) = 

(c) fix , y) 

(d) fix) = 


XT 

- maximum of x and y 

( x/2 when x is even 

Ux - l )/2 when x is odd 


(e) The sign function defined by 

sgn( 0 ) = 0 . sgn(v) = 1 if x > 0 . 
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(f) L(x. y) - 


if x > y 
if x < y 


11.3 

11.4 


f 1 if x = v 


(g) £(*, y) ~ 


if x ^ y 


Compute A( 3, 2). A(2, 3), A(3, 3). 


Show that the following functions are primitive recursive: 

(a) q(x, y) = the quotient obtained when .v is divided by y 

(b) r(x, y) - the remainder obtained when x is divided by y 


(c) fix) = 



if x is a perfect square 
otherwise 


11.5 Show that f(x) = integral part of ^ is partial recursive. 

11.6 Show that the Fibonacci numbers are generated by a primitive recursive 
function. 

11.7 Let /( 0) = 1, /( 1) = 2. /(2) = 3 and /(.x + 3) = fix) + fix + l) 2 + 
/O + 2)~\ Show that /(v) is primitive recursive. 

11.8 The characteristic function Xa of a given set A is defined as 


[0 

X.\ia) = \ 

t 


if a € A 
if a e A 


If A. 5 are subsets of N and Xa* Xb are recursive, show that Xa • 

Xa n b are also recursive. 

11.9 Show that the characteristic function of the set of all even numbers is 
recursive. Prove that the characteristic function of the set of all odd 
integers is recursive. 

11.10 Show that the function fix. y) = x - y is partial recursive. 

11.11 Show that a constant function over N\ i.e. fin) = k for all n in N where 
A: is a fixed number, is primitive recursive. 

11.12 Show that the characteristic function of a finite subset of N is primitive 
recursive. 

11.13 Show that the addition function /fix, y) is Turing-computable. 
(Represent v and y in tally notation and use concatenation.) 

11.14 Show that the Turing machine M in the Post notation (i.e, the transition 
function specified by quadruples) can be simulated by a Turing 
machine M (as defined in Chapter 9). 

[Hint: The transition given by a quadruple can be simulated by tw 7 o 
quintuples of M' by adding new states to Mi] 
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11.15 Compute Z(4) using the Turing machine constructed for computing the 
zero function. 

11.16 Compute 5(3) using the Turing machine which computes 5. 

11.17 Compute U{( 2, 1, 1), 2, 1), C//(L 2, 1) using the Turing 

machines which can compute the projection functions. 

11.18 Construct a Turing machine which can compute f(x) = x + 2. 

11.19 Construct a Turing machine which can compute f(x u x 2 ) = x { + 2 for 
the arguments 1, 2 (i.e. X[ = 1, x 2 = 2). 

11.20 Construct a Turing machine which can compute /(.vj. a 2 ) = A| + a 2 for 
the arguments 2, 3 (i.e. x x = 2, a 2 = 3). 



Complexity 


When a problem/language is decidable, it simply means that the problem is 
computationally solvable in principle. It may not be solvable in practice in the 
sense that it may require enormous amount of computation time and memory. 
In this chapter we discuss the computational complexity of a problem. The 
proofs of decidability/undecidability are quite rigorous, since they depend 
solely on the definition of a Turing machine and rigorous mathematical 
techniques. But the proof and the discussion in complexity theory rests on the 
assumption that P t- NP The computer scientists and mathematicians strongly 
believe that P ^ NP, but this is still open. 

This problem is one of the challenging problems of the 21st century. This 
problem carries a prize money of $1M. P stands for the class of problems that 
can be solved by a deterministic algorithm (i.e. by a Turing machine that 
halts) in polynomial time; NP stands for the class of problems that can be 
solved by a nondeterministic algorithm (that is, by a nondeterministic TM) in 
polynomial time; P stands for polynomial and NP for nondeterminisitc 
polynomial. Another important class is the class of AP-complete problems 
which is a subclass of NP. 

In this chapter these concepts are formalized and Cook's theorem on the 
NP-co mpleteness of SAT problem is proved. 


12.1 GROWTH RATE OF FUNCTIONS 

When we have two algorithms for the same problem, we may require a 
comparison between the running time of these two algorithms. With this in 
mind, we study the growth rate of functions defined on the set of natural 
numbers. 

In this section. N denotes the set of natural numbers. 
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Definition 12.1 Let f g : N — » R* (R + being the set of all positive real 
numbers). We say that f(n) = 0(g(n)) if there exist positive integers C and 
Nq such that 

fin) < Cg(n ) for all n > N 0 . 

In this case we say / is of the order of g (or / is 4 big oh' of g) 

Note: f{n) = Oigin )) is not an equation. It expresses a relation between two 
functions / and g. 


EXAMPLE 12.1 


Let f{n) = 4n 3 4 5/r 4 In 4 3. Prove that /(??) = £?(n J ). 

Solution 

In order to prove that/(n) = 0(/r), take C = 5 and /V 0 =10. Then 

f(n) - 4 ir 4 5/r 4 In + 3 < 5 n 3 for n > 10 

When n = 10. 5w 2 4 7/? 4 3 = 573 < 10 3 . For n > 10. 5n 2 4 In + 3 < ir\ 
Then, fin) = 0(if ). 

Theorem 12.1 If 77 ( 77 ) = a k n k 4 4 • • • 4 ap? 4 a 0 is a polynomial 

of degree A' over Z and a k > 0. then p(n) - 0(n k ). 

Proof pin) = a k n k 4 + * * * + api 4 a Q . As a k is an integer and 

positive. a k > 1. 

As ci k _ j. ci k _ 2 ? . . <7 0 and A are fixed integers, choose A 7 0 such that for 

all 7 ? > Av, each of the numbers 


I^A-l I \ a k~2 I 


n *" 1 ’ n k 


is less than 


(*) 


Hence. 


\ a k -\ , **-2 , , % 

|-4 —^-• ’ * -r —r 

77 ?r /r 


<1 


As a k > 1. —— = ^ 4 1- 4 * ■ • 4 -^1—v -^ 7 - > 0 for all n > A/q 

72 * 77 

Also, 


/i *" 1 n k 


pin) 


f «A-1 


T~ - u k 


a \ ^ «0 


l « ' n* -1 n A 




So. 


< a* 4 1 by (*) 
pin) < Cn k ’ where C = ^ : 4 1 


Hence. 


77(77) = 0 { if ). 


I 






348 y Theory of Computer Science 


Corollary The order of a polynomial is determined by its degree. 

Definition 12.2 An exponential function is a function q : N —> N defined by 
q(n) = a n for some fixed a > 1. 

When n increases, each of n, nr, 2 n increases. But a comparison of these 
functions for specific values of n will indicate the vast difference between the 
growth rate of these functions. 


TABLE 12.1 Growth Rate of Polynomial and Exponential Functions 


n 

f(n) = n 2 

g(n) = n 2 + 3n + 9 

q(n) = 2” 

1 

1 

13 

2 

5 

25 

49 

32 

10 

100 

139 

1024 

50 

2500 

2659 

(1.13)10 15 

100 

10000 

10309 

(1.27) 10 30 

1000 

1000000 

1003009 

(1.07)10 301 


From Table 12.1. it is easy to see that the function q(n) grows at a very fast 
rate when compared to f(n) or g(n\ In particular the exponential function 
grow s at a very fast rate when compared to any polynomial of large degree. 
We prove a precise statement comparing the growth rate of polynomials and 
exponential function. 

Definition 12.3 We say g ^ 0(f), if for any constant C and Aq, there exists 
n > A'o such that g(n) > Cf(n ). 

Definition 12>4 If / and g are two functions and / - 0(g), but g ^ 0(f), 
w ? e say that the growth rate of g is greater than that of f (In this case 
g(n)!f(n) becomes unbounded as n increases to *o.) 


Theorem 12.2 The growth rate of any exponential function is greater than 
that of any polynomial. 


Proof Let p(n) = a k n k + a k _ { n k 1 + . . . + a x n + a 0 and q(n) = a 11 for some 
a > 1 . 


As the growth rate of any polynomial is determined by its term with the 
highest power, it is enough to prove that n K = 0(a n ) and a 11 ^ 0(n k ). By 

L'Hospital's rule. l? iitends to 0 as n (Here log n = log^n.) If 

n ^ 



z(n) - 


then. 


As n gets large, k 


flog n'] 


k\ 

e 


log n \ 


Josh* 

- e - = e ^ - n 


v 


“—j tends to 0 and hence ~(n) tends to 0. 
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So we can choose N 0 such that z(n) < a for all n > Nq. Hence n k ~ 
z(n) n < cf\ proving n k = 0(d l ). 

To prove a n 7 * 0(n k ), it is enough to show that a u /n k is unbounded for 
large n. But we have proved that n k < d } for large n and any positive integer 

n n 

k and hence for k + 1. So n k+i < d' or —— > 1. 

n M 

CL 1 

> 72 , which means — is unbounded for large 
n 

values of n. I 

Note: The function w log ” lies between any polynomial function and a 11 for 
any constant a . As log n > k for a given constant k and large values of n , 
/ 2 log 11 > n k for large values of n. Hence n Iog ” dominates any polynomial. But 

77 logn = (e lo ^«) log ” = Let us calculate lim ^ —- . By L’Hospital’s 

CX 

rule, lim = lim(2log x )- = lim -* = lim — = 0 . 

A '—CX A'->=c C x —CX a —>?c CX 

So (log 7z)" grows more slowly than cn. Hence n Iog/? = e (iog//) “ grows more 
slowly than 2 cn . The same holds good when logarithm is taken over base 2 
since log t >n and log^? differ by a constant factor. 

Hence there exist functions lying between polynomials and exponential 
functions. 

12.2 THE CLASSES P AND NP 

In this section we introduce the classes P and NP of languages. 

Definition 12*5 A Turing machine M is said to be of time complexity T(n) 
if the following holds: Given an input w of length n y M halts after making at 
most T(n) moves. 

Note: In this case. M eventually halts. Recall that the standard TM is called 
a deterministic TM. 

Definition 12.6 A language L is in class P if there exists some polynomial 
Tin) such that L - T(M) for some deterministic TM M of time complexity 
T{n). 


Multiplying by n , n 


k +1 


EXAMPLE 12.2 


Construct the time complexity T(n) for the Turing machine M given in 
Example 9.7. 
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Solution 

In Example 9.7. the step (i) consists of going through the input string (0 ? T r ) 
forward and backward and replacing the leftmost 0 by x and the leftmost 1 
by y. So we require at most 2 n moves to match a 0 with a I. Step (ii) is 
repetition of step (i) n times. Hence the number of moves for accepting a n h n 
is at most (2 n)(n). For strings not of the form cfb v \ TM halts with less than 
2 n 2 steps. Hence T(M) = 0(ir), 

We can also define the complexity of algorithms. In the case of 
algorithms. T(n) denotes the running time for solving a problem with an input 
of size n , using this algorithm. 

In Example 12.2. we use the notation which is used in expressing 
algorithm. For example, a <r- b means replacing a by b. 

[a] denotes the smallest integer greater than or equal to a. This is called 
the ceiling function. 


EXAMPLE 12.3 


Find the running time for the Euclidean algorithm for evaluating gcd(m b) 
where a and b are positive integers expressed in binary representation. 

Solution 

The Euclidean algorithm has the following steps: 

1. The input is (a, b) 

2. Repeat until b = 0 

3. Assign a <— a mod b 

4. Exchange a and b 

5. Output a. 

Step 3 replaces a by a mod b. If all > b, then a mod b < b < all. If 
all < b, then a < lb . Write a = b + r for some r < b. Then a mod b = 
r < b < all. Hence a mod b < all . So a is reduced by at least half in size on 
the application of step 3. Hence one iteration of step 3 and step 4 reduces a 
and b by at least half in size. So the maximum number of times the steps 3 
and 4 are executed is min{riog 2 <H flog 2 hl}. If n denotes the maximum of the 
number of digits of a and h . that is max{[log 2 fll. riog 2 & 1 } then the number of 
iterations of steps 3 and 4 is O(n). We have to perform step 2 at most 
min{iiog2^1. times or n times. Hence Tin) = nO(n) - 0(n~). 

Note: The Euclidean algorithm is a polynomial algorithm. 

Definition 12.7 A language L is in class NP if there is a nondeterministic 
TM M and a polynomial time complexity T(n) such that L = T(M) and M 
executes at most T(n) moves for every input w of length n. 
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We have seen that a deterministic TM simulating a nondeterministic 
TM M exists (refer to Theorem 9.3). If T(n) is the complexity of M y then the 
complexity of the equivalent deterministic TM M l is l 0iT[u) \ This can be 
justified as follows. The processing of an input string w of length n by M is 
equivalent to a 'tree' of computations by M x . Let k be the maximum of the 
number of choices forced by the nondeterministic transition function. (It is 
max|5(<7, a*))|, the maximum taken over all states q and all tape symbol X.) 
Every branch of the computation tree has a length T(n) or less. Hence the total 
number of leaves is almost kT(n). Hence the complexity of M x is at most 

'yO(Tinf) 

It is not known whether the complexity of M x is less than 2 0ll ^ }1)) . Once 
again an answer to this question will prove or disprove P ^ NP. But there do 
exist algorithms where T(n) lies between a polynomial and an exponential 
function (refer to Section 12.1). 


123 POLYNOMIAL TIME REDUCTION AND 
/VP-COMPLETE NESS 

If P 2 and P 2 are two problems and P 2 e P, then we can decide whether 
Pi e P by relating the two problems P x and P 2 . If there is an algorithm for 
obtaining an instance of P 2 given any instance of P b then we can decide about 
the problem P r Intuitively if this algorithm is a polynomial one, then the 
problem P x can be decided in polynomial time. 

Definition 12.8 Let P 1 and P 2 be two problems. A reduction from P\ to P 2 
is an algorithm which converts an instance of P\ to an instance of P 2 . If the 
time taken by the algorithm is a polynomial p(n ), n being the length of the 
input of P x . then the reduction is called a polynomial reduction P\ to P 2 . 

Theorem 12.3 If there is a polynomial time reduction from P x to P 2 and if 
P 2 is in P then P x is in P. 

Proof Let m denote the size of the input of Pj. As there is a polynomial¬ 
time reduction of Pi to P 2 . the corresponding instance of P 2 can be got in 
polynomial-time. Let it be O(m), So the size of the resulting input of P 2 is 
atmost cm J for some constant c. As P 2 is in P, the time taken for deciding the 
membership in P 2 is 0(n k ). n being the size of the input of P 2 . So the total 
time taken for deciding the membership of m-size input of Pj is the sum of 
the time taken for conversion into an instance of P 2 and the time for decision 
of the corresponding input in P 2 . This is 0[m j + (cm J ) k ], which is the same 
as 0{rrf k ), So P { is in P. I 

Definition 12.9 Let L be a language or problem in NP. Then L is NP- 
complete if 

1. L is in NP 
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2. For every language L f in NP there exists a polynomial-time reduction 
of V to L 

Note: . The class of A 7 P-complete languages is a subclass of NP. 

The next theorem can be used to enlarge the class of NP-complete 
problems provided we have some known AP-complete problems. 

Theorem 12.4 If is AP-complete, and there is a polynomial-time 
reduction of P x to P 2 , then P 2 is AP-complete. 

Proof If L is any language in NP, we show that there is a polynomial-time 
reduction of L to P 2 . As P\ is AP-complete, there is a polynomial-time 
reduction of L to P { . So the time taken for converting an n-size input string 
w in L to a string x in P { is at most p\(n) for some polynomial py As there 
is a polynomial-time reduction of P x to P 2 , there exists a polynomial p 2 such 
that the input x to Pj is transferred into input y to P 2 in at most p 2 (n) time. 
So the time taken for transforming w to y is at most pfn) + p 2 (p\{n)). As 
Pi(n) + p 2 (p\(n)) is a polynomial, we get a polynomial-time reduction of 
L to P y Hence P 2 is A r P-complete. I 

Theorem 12.5 If some AT’-complete problem is in P, then P = NP. 

Proof Let P be an AP-complete problem and P e P. Let L be any 
AP-complete problem. By definition, there is a polynomial-time reduction of 
L to P. As P is in P, L is also in P by Theorem 12.3. Hence NP = P. 


12.4 IMPORTANCE OF /VP-COMPLETE PROBLEMS 

In Section 12.3, we proved theorems regarding the properties of NP-complete 
problems. At the beginning of this chapter we noted that the computer 
scientists and mathematicians strongly believe that P ^ NP. At the same time, 
no problem in NP is proved to be in P. The entire complexity theory rests 
on the strong belief that P ^ NP. 

Theorem 12.4 enables us to extend the class of AP-complete problems, 
while Theorem 12.5 asserts that the existence of one AP-complete problem 
admitting a polynomial-time algorithm will prove P = NP. More than 2500 
AP-complete problems in various fields have been found so far. 

We will prove the existence of an AP-complete problem in Section 12.5. 
We will give a list of AP-complete problems in Section 12.6. Thousands of 
A 7 P-complete problems in various branches such as Operations Research, 
Logic, Graph Theory, Combinatorics, etc. have been constructed so far. A 
polynomial-time algorithm for any one of there problems will yield a proof 
of P = NP. But such multitude of A 7 P-complete problems only strengthens the 
belief of the computer scientists that P ^ NP. We will discuss more about this 
in Section 12.7. 
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12.5 SAT IS NP-C OMPLETE 

In this section, we prove that the satisfiability problem for boolean expressions 
(whether a boolean expression is satisfiable) is AT-complete. This is the first 
problem to be proved A’P-complete. Cook proved this theorem in 1971. 

12.5.1 Boolean Expressions 

In Section 1.1.2, we defined a well-formed formula involving propositional 
variables. A boolean expression is a well-formed formula involving boolean 
variables x , y, z replacing propositions P, Q y R and connectives v, a and —■. 
The tmth value of a boolean expression in x, y, z is determined from the truth 
values of x, y, z and the truth tables for v, a and -n. For example, -r x a -i 
( y v z) is a boolean expression. The expression —i x a —i (y v z) is true when 
a* is false, y is false and c is false. 

Definition 12.10 (a) A tmth assignment t for a boolean expression E is the 

assignment of tmth values T or F to each of the variables in E . For example, 
t - (F, F, F) is a tmth assignment for (x\ y, z) where x, y, z are the variables 
in a boolean expression E(x } y, z) = —■ x a —i (y v z)- 

The value E(t) of the boolean expression E given a truth assignment t is 
the tmth value of the expression of E, if the truth values give by t are assigned 
to the respective variables. 

If t ~ (F, F, F) then the tmth values of —i a and —i (y v z) are T and T. 
Hence the value of E - — , a* a —i (y v z) is T. So E(t) = T. 

Definition 12.11 A truth assignment t satisfies a boolean expression E if the 
tmth value of E(t) is T. In other words, the tmth assignment t makes the 
expression E tme. 

Definition 12.12 A boolean expression E is satisfiable if there exists at least 
one tmth assignment t that satisfies E (that is E(t) = 7). For example, E = 
—i A' a —i (v v -) is satisfiable since E(t) = T when t = {F, F f F). 


12.5.2 Coding a Boolean Expression 

The symbols in a boolean expression are the variables a, y, z> etc. the 
connectives v. a, —i. and parantheses ( and ). Thus a boolean expression in 
three variables wall have eight distinct symbols. The variables are written as 
Aj. Ai, a 3 , etc. Also we use x„ only after using a i? a 2 , . . ., a ;i _i for variables. 
We encode a boolean expression as follow's: 

1. The variables x u a 2 , a 2 , . . . are w^ritten as xl, aIO, a11, . . . etc. (The 
binary^ representation of the subscript is written after a.) 

2. The connectives v. a, —i, (, and ) are retained in the encoded 
expression. 
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For example, -i x a -i (y v z) is encoded as -i aIa -i (xlO v All), (where 
x, y, z are represented by x u x 2 , x 3 ). 

Note: Any boolean expression is encoded as a string over X = {a; 0, 1. v, 

A, C (, )} 

Consider a boolean expression having m occurrences of variables, 
connectives and parantheses. The variable x m can be represented using 
1 + log 2 m symbols (x together with the digits in the binary representation of 
//?). The other occurrences require less symbols. So any occurrence of a 
variable, connective or a parenthesis requires at most 1 + log 2 m symbols over 
X. So the length of the encoded expression is at most 0(m log m). 

As our interest is only in deciding whether a problem can be solved in 
polynomial-time, we need not distinguish between the length of the coded 
expression and the number of occurrences of variables etc. in a boolean 
expression. 


12.5.3 Cook’s Theorem 

In this section we define the SAT problem and prove the Cook's theorem that 
SAT is NP-co mplete. 

Definition 12.13 The satisfiability problem (SAT) is the problem: 

Given a boolean expression, is it satisfiable? 

Note: The SAT problem can also be formulated as a language. We can 
define SAT as the set of all coded boolean expressions that are satisfiable. So 
the problem is to decide whether a given coded boolean expression is in SAT. 

Theorem 12.6 (Cook's theorem) SAT is AR-complete. 

Proof PART I: SAT e NP. 

If the encoded expression E is of length n, then the number of variables is 
Tn/21. Hence, for guessing a truth assignment t we can use multitape TM for 
E. The time taken by a multitape NTM M is 0(n). Then M evaluates the 
value of E for a truth assignment t. This is done in 0(rr) time. An equivalent 
single-tape TM takes 0(n 4 ) time. Once an accepting truth assignment is found, 
M accepts E and M and halts. Thus we have found a polynomial time 
NTM for SAT. Hence SAT e NP. 

PART II: POLYNOMIAL-TIME REDUCTION OF ANY L IN NP TO SAT. 
1. Construction of NTM for L 

Let L be any language in NP. Then there exists a single-tape NTM M and a 
polynomial p(n) such that the time taken by M for an input of length n is at 
most p(n) along any branch. We can further assume that this M never writes 
a blank on any move and never moves its head to the left of its initial tape 
position (refer to Example 12.6). 
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If M accepts an input w and | >v | - n , then there exists a sequence of moves 
of M such that 

1. Oq is the initial ID of M with input iv. 

2. Or, j— a x |— • • • [— 0Cb k - pO 0. 

3. a k is an ID with an accepting state. 

4. Each (X: is a string of nonblanks, its leftmost symbol being the 
leftmost symbol of w (the only exception occurs when the processing 
of w is complete, in which case the ID is qb). 

2. Representation of Sequence of Moves of M 

As the maximum number of steps on w is p(n) we need not bother about the 
contents beyond p(n ) cells. We can write a 7 as a sequence of p(n) + 1 symbols 
(one symbol for the state and the remaining symbols for the tape symbols). 

SO Cl} — Xfc)Xjl . . . Xj, p(ny 

By assuming Q n Y = 0, we can locate the state in and hence the 
position of the tape head. The length of some ID may be less than p(n). In 
this case we pad the 3D on the right with blank symbols, so that all IDs are 
of the same length p(n) + 1. Also the acceptance may happen earlier. If a m 
is an accepting ID in the course of processing w, then we write Oq |— ... |— 
a m j— <*///••• |— a m - a i.p(nr 

Thus all IDs have p(n) + 1 symbols and any computation has p(n) moves. 


TABLE 12.2 Array of IDs 


ID 

0 1 

j - 1 

j j +1 

p(n) 

a Q 

A'oo Xoi 



*0 r p{n) 

a i 

*10 *11 



*1 .pin) 

a. 

Ao */i 

x LH 

Xjj X,-j +1 

■ ■ Xjpfr) 

a;+i 

V+1,0 X /+1i i 

Xm.h 

*/+ 1 j */+i ,;+i 

■ ■ */'+1 t p(n) 

Ctpi.n) 




*p(n).p(/i) 

So 

we can represent any 

computation 

as an (p(n) +1) x (p(n) + 1) array 


as in Table 12.2. 

3, Representation of IDs in Terms of Boolean Variables 

We define a boolean variable \y } - A corresponding to (/. j)th entry' in the zth ID. 
The variable \;j A represents the proposition that Xjj = A . where A is a state or 
tape symbol and 0 < i, j < p(n). 

We simulate the sequence of IDs leading to the acceptance of an input 
string iv by a boolean expression. This is done in such a way that M accepts 
ir if an only if the simulated boolean expression E Mav is satisfiable. 
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4, Polynomial Reduction of M to SAT 

In order to check that the reduction of M to SAT is correct, we have to ensure 
the correctness of 

(a) the initial ID, 

(b) the accepting ID. and 

(c) the intermediate moves between successive IDs. 

(a) Simulation of initial ID 

X 00 must start with the initial state q 0 °f M followed by the symbols of 
w = a\ci 2 ... a n of length n and ending with b's (blank symbol). The 
corresponding boolean expression S is defined as 

S = y'OGtjQ A To 1 ! A }'01«2 A ... A A Y()ji+l*b A . . . A Vq ip(n),b 

Thus given an encoding of M and w, we can write S in a tape of a multiple 
TM My. This takes 0(p(n)) time. 

(b) Simulation of accepting ID 

a pUn is the accepting ID. If p x , Pk are t^ e accepting states of M, then 

a p[n , contains one of p- s. 1 < i < k in any place j. If a pin) contains an accepting 
state pi in jth position, then x p(flu j is the accepting state p The corresponding 
boolean expression covering all the cases (0 < j < p(n), I < i < k) is given 
by 

F - F 0 v F { v . . . v F p(n) 

where 

k'j — yp(n).j.p { ^ yp(n)j f p2 ^ ^ ypinij. p^ 

Each Fj has k variables and hence has constant number of symbols 
depending on M but not on n. The number of Ff s in F is p(n). Thus given 
an encoding of M and \\\ F can be written in 0(p(n)) time on the multiple 
TM M x . 

(c) Simulation of intermediate moves 

We have to simulate valid moves |— a i+ i, i = 0, 1, 2, ... p(n). 

Corresponding to each move, we have to define a boolean variable Np Hence 
the entire sequence of IDs leading to acceptance of w is 

N = N 0 a Aq a ... a Npoo-i 

First of all note that the symbol X i+ \j can be determined from X ip 

Xj j +i by the move (if there is one changing a t to a different a /+1 ). For every 
position (/, j). we have two cases: 

Case 1 The state of is at position j. 

Case 2 The state of is not in any of the (j - l)th, jth and (j + l)th 
positions. 
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Case 1 is taken care of by a variable Ay and Case 2 by a variable B l y. 
The variable yV f - will be designed in such a way that it gurantees that ID 
a i+] is one of the IDs that follows the ID a r 
X {+i j can be determined from 

(i) the three symbols X L j_ h X l]y X hjJrl above it 

(ii) the move chosen by the nondeterministic TM M when one of the three 
symbols (in (i)) is a state. 

If the state of a,- is not Xy, X/_j or X L ,- +1 . then X i+ \j = Xy. This is taken 
care of by the variable By. 

If Xtf is the state of a h then X L /+1 is being scanned by the state Xy. The 
move corresponding to the state-tape symbol pair (Xy, Xj m j+\) will determine 
the sequence X /+1>/ -_ { X iM jX i+ ij + \. This is taken care of by the variable Ay. 
We write Nj = a 7 - (Ay v By), where a is taken over all f s. 0 < j < p(n ). 

(i) Formulation of Bjj When the state of a* is none of Xy, 

then the transition corresponding to |— a j+] will not affect X / ;+1 . In this 

case Xj+i'; ~ Xjj 

Denote the tape symbols by Zj, Z 2 , . . Z r Then Xjj_\, Xyj and Xjj + \ 
are the only tape symbols. So we write By as 

By v v v y'uj-i.z) A 

(>'/,/z 1 v )'i,j.z 2 v yij.z r ) A 

(>’/. ,/+l,Z 1? v Vf.j+I.Z, v >’/../+i.z,) A 

D/,;.z 1 A Xvl/.Zj) v Ok./.z, A . v m.;,z : v ... v (>y/,z r A T/+ij,z r ) 

This first line of By says that X L / -_ 1 is one of the tape symbols 
Zi. Z 2 . . . Z r The second and third lines are regarding X Lj and X L ^. The 
fourth line says that Xy and X/^ +1 are the same and the common value is any 
one of Zj. Z 2 . . . Z r 

Recall that the head of M never moves to the left of 0-cell and does not 
have to move to the right of the p(?i)-cell. So B$ will not have the first line 
and B Lp(m will not have the third line. 

(ii) Formulation of Ay This step corresponds to the correctness of 
the 2x3 array (see Table 12.3). 


TABLE 12.3 Valid Computation 


X;J- 1 

! -v, 

j 

[— 

1 

x u* 

Z'+i ,j-‘\ 

| X M.j 

| 

Xj+ 1. /+ i j 
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The expression B t j takes care of the case when the state of a /: is not at the 
position X L j_i, X Lj or X L y +1 , The Ajj coiTesponds to the case when the state 
of is at the position X V] . In this case we have to assign boolean variables 
to six positions given in Table 12.3 so that the transition corresponding to 
(X; |— a i+ i is described by the variables in the box correctly. 

We say that that an assignment of symbols to the six variables in the box 
is valid if 

1. Xjj is a state but X it j_ { and Xjj+i are tape symbols. 

2. Exactly one of X i+uj _ u X i+L p X i+l j+] is a state. 

3. There is a move which explains how (X L X }p X L ; - +1 ) changes to 
(Xi+ij-i* Xi+\j’Xj+ij+i) in a } f a i+i . 

There are only a finite number of valid assignments and A# is obtained 
by applying OR (that is v) to these valid assignments. A valid assignment 
corresponds to one of the following four cases: 

Case A (p, C, L) e S(q, A) 

Case B (p, C, R ) e S(q, A) 

Case C a t - (when a. and a ;+l contain an accepting state) 

Case D 7 = 0 and j = 77 ( 77 ) 

Case A Let D be some tape symbol of M, Then X } } _\XjjX L j +] = DqA and 
X i+L j_\X i+ i j X i+L j + i = pDC . This can be expressed by the boolean variable. 

)T/-b D A yij.tf A Ti.y+l,,! A 3’/+K/-l./? A > ? /+L j. D A ) ? h-L/+1,C 

Case B As in case A, let D be any tape symbol. In this case X L j-\XjjXjq + y 
= DqA and X,- \,j-iX i+ \= DCp. The corresponding boolean 
expression is 

)?./-]. D A 3 i. 7 . q A 3 /. j-r 1. A A y i-r l. j- 1. D A 3V+ ), /, C A 3 /+1, j+ 1, p 

Case C In this case X i j_[X i jX i j + 1 = X; +1 jX /+L/ - +1 . 

In this case the same tape symbol say D appears in X - h7 _| and X i+l j_ } \ some 
other tape symbol say D' in X u ; +1 and X ?+L/+1 . X Li and contain the 

same state. One typical boolean expression is 

}/. A 3/. /. <y A 3/,.7-t-K Z; A 3H-l./-), Z/ : A 37+1./. q A 3 /+1. /+!. Z^ 

Case D When j - 0. we have only X /0 Xu and X /+1 0 X i+L j. This is a special 
case of Case B. j = /?(7z) corresponds to a special case of Case A. 

So. Ajj is defined as the OR of all valid terms obtained in Case A to 
Case D . 

(iii) Definition of N t and N We define N- t and N by 

^i- (A-o v B l0 ) a (A n v Bn) a ... a (Aj p (u) v B i>pin) ) 

N — Aq a A j a N 2 a ... a A 
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(iv) Time taken for writing N The time taken to write B if is a constant 
depending on the number \T\ of tape symbols. (Actually the number of 
variables in B p is 5|T|). The time taken to write depends only on the 
number of moves of M. As A? is obtained by applying OR to A {j a 
0 < / < pin) - 1 . 0 < j < p{n) - 1 , the time taken to write on Nj is Oip(n)). 
As N is obtained by applying a to N {h A/j, . , .. N p{n] _i. the time taken to write 
N is p(n)0(p(ri)) = Oip\n)). 

5, Completion of Proof 

Let E h , u Vt . = S a N a F. 

We have seen that the time taken to write S and F are 0(p(n)) and the 
time taken for N is 0(p 2 (n )). Hence the time taken to write E M w is 0(p 2 (n)). 

Also M accepts w if and only if E M w is satisfiable. 

Hence the deterministic multitape TM M x can convert w to a boolean 
expression E M w in 0(p 2 (n)) time. An equivalent single tape TM takes 
0(p 4 (n)) time. This proves the Part II of the Cook's theorem, thus completing 
the proof of this theorem. I 

12.6 OTHER AfP-COMPLETE PROBLEMS 

In the last section, we proved the AT-completeness of SAT. Actually it is 
difficult to prove the AT-completeness of any problem. But after getting one 
A T P-complete problem such as SAT. we can prove the AP-completeness of 
problem P 7 by obtaining a polynomial reduction of SAT to P\ The 
polynomial reduction of SAT to P' is relatively easy. In this section we give 
a list of A r P-complete problems without proving their AT-completeness. Many 
of the AT-complete problems are of practical interest. 

1. CSAT—Given a boolean expression in CNF (conjunctive normal 
form—Definition 1.10). is it satisfiable? 

We can prove that CSAT is AT-complete by proving that CSAT is 
in NP and getting a polynomial reduction from SAT to CSAT. 

2. Hamiltonian circuit problem—Does G have a Hamiltonian circuit (i.e. 
a circuit passing through each edge of G exactly once)? 

3. Travelling salesman problem (TSP)—Given n cities, the distance 
between them and a number D, does there exist a tour programme for 
a salesman to visit all the cities exactly once so that the distance 
travelled is at most D1 

4. Vertex cover problem—Given a graph G and a natural number k f does 
there exist a vertex cover for G with k vertices? (A subsets C of 
vertices of G is a vertex cover for G if each edge of G has an odd 
vertex in C.) 
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5. Knapsack problem—Given a set A - {a x , a 2 , . . a n ) of nonnegative 
integers, and an integer K } does there exist a subset B of A such that 

A R b l = 

b ; eB 

This list of AE-complete problems can be expanded by having a 
polynomial reduction of known iVP-complete problems to the problems which 
are in NP and which axe suspected to be A r P-complete. 

12.7 USE OF NP-COMPLETENESS 

One practical use in discovering that problem is AP-complete is that it 
prevents us from wasting our time and energy over finding polynomial or easy 
algorithms for that problem. 

Also we may not need the full generality of an NP-c omplete problem. 
Particular cases may be useful and they may admit polynomial algorithms. 
Also there may exist polynomial algorithms for getting an approximate 
optimal solution to a given NP -complete problem. 

For example, the travelling salesman problem satisfying the triangular 
inequality for distances between cities (i.e. d\j < d ik + d k j for all /, j, k) has 
approximate polynomial algorithm such that the ratio of the error to the 
optimal values of total distance travelled is iess than or equal to 1/2. 


12.8 QUANTUM COMPUTATION 

In the earlier sections we discussed the complexity of algorithm and the dead 
end was the open problem P = NP. Also the class of AP-complete problems 
provided us with a class of problems. If we get a polynomial algorithm for 
solving one NP-c omplete problem we can get a polynomial algorithm for any 
other /VT-complete problem. 

In 1982. Richard Feynmann, a Nobel laurate in physics suggested that 
scientists should start thinking of building computers based on the principles 
of quantum mechanics. The subject of physics studies elementary objects and 
simple systems and the study becomes more intersting when things are larger 
and more complicated. Quantum computation and information based on the 
principles of Quantum Mechanics will provide tools to fill up the gulf between 
the small and the relatively complex systems in physics. In this section we 
provide a brief survey of quantum computation and information and its impact 
on complexity theory. 

Quantum mechanics arose in the early 1920s, when classical physics could 
not explain everything even after adding ad hoc hypotheses. The rules of 
quantum mechanics were simple but looked counterintuitive, and even Albert 
Einstein reconciled himself with quantum mechanics only with a pinch of salt. 

Quantum Mechanics is real black magic calculus. 

—A. Einstein 
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12.8.1 Quantum Computers 

We know that a bit (a 0 or a 1) is the fundamental concept of classical 
computation and information. Also a classical computer is built from an 
electronic circuit containing wires and logical gates. Let us study quantum bits 
and quantum circuits which are analogous to bits and (classical) circuits. 

A quantum bit, or simply qubit can be described mathematically as 

W) = a|0> + j3|0> 

The qubit can be explained as follows. A classical bit has two states, a 0 and 
a 1. Two possible states for a qubit are the states |0) and |1). (The notation 
|.) is due to Dirac.) Unlike a classical bit, a qubit can be in infinite number 
of states other than |0) and |1). It can be in a state \y/) = a|0) + /3|0), where 
a and /3 are complex numbers such that \a\ 2 + \fi\ 2 = 1. The 0 and 1 are called 
the computational basis states and |i//) is called a superposition. We can call 
| \ff) = a|0) + ft 10) a quantum state. 

In the classical case, we can observe it as a 0 or a 1. But it is not possible 
to determine the quantum state on observation. When we measure/observe a 
qubit, we get either the state jO) with probability \a \ 2 or the state jl) with 
probability |/3p. 

This is difficult to visualize, using our ‘classical thinking' but this is the 
source of power of the quantum computation. 

Multiple qubits can be defined in a similar way. For example, a two-qubit 
system has four computational basis states, (00), (01), 110) and 111) and 
quantum states \y/) ~ o^OO) + #bi|01) + c%|10) + a^ll) with |o^ 0 | 2 + (c^ip 

+ |a 10 p + |a u p = 1. 

Now we define the qubit gates. The classical NOT gate interchanges 0 
and 1. In the case of the qubit the NOT gate, «|0) + /311), is changed to 
«|1) + /3|0). 

The action of the qubit NOT gate is linear on two-dimensional complex 
vector spaces. So the qubit NOT gate can be described by 


a 

A 

"o 

r 

a 


A 

A 


1 

0_ 

A 


a 


The matrix 


0 

1 


1 

0 


is a unitary matrix. (A matrix A is unitary if A adj A = /.) 


We have seen earlier that {NOR} is functionally complete (refer to 
Exercises of Chapter 1). The qubit gate corresponding to NOR is the 
controlled-NOT or CNOT gate. It can be described by 

|A, B) -> | A. B ® A) 
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where © denotes addition modulo 2. The action on computational basis is 
|00) —» 100). |01) —> |01), 110) —> |11). |11) —> 110). It can be described by 
the following 4x4 unitary matrix: 

"1 0 0 0 “ 

0 10 0 
0 0 0 1 
0 0 1 0 _ 

Now. we are in a position to define a quantum computer: 

A quantum computer is a system built from quantum circuits , containing 
wires and elementally quantum gates , to cany out manipulation of quantum 
infonnation. 


12.8.2 Church-Turing Thesis 

Since 1970s many techniques for controlling the single quantum systems have 
been developed but with only modest success. But an experimental prototype 
for performing quantum cryptography, even at the initial level may be useful 
for some real-world applications. 

Recall the Church-Turing thesis which asserts that any algorithm that can 
be performed on any computing machine can be performed on a Turing 
machine as well. 

Miniaturization of chips has increased the power of the computer. The 
growth of computer power is now described by Moore's law. which states that 
the computer power will double for constant cost once in every two years. 
Now it is felt that a limit to this doubling power will be reached in two or 
three decades, since the quantum effects will begin to interfere in the 
functioning of electronic devices as they are made smaller and smaller. So 
efforts are on to provide a theory of quantum computation which will 
compensate for the possible failure of the Moore’s law. 

As an algorithm requiring polynomial time was considered as an efficient 
algorithm, a strengthened version of the Church-Turing thesis was enunciated. 

Any algorithmic process can be simulated efficiently by a Turing machine . 
But a challenge to the strong Church-Turing thesis arose from analog 
computation. Certain types of analog computers solved some problems 
efficiently whereas these problems had no efficient solution on a Turing 
machine. But when the presence of noise was taken into account, the power 
of the analog computers disappeared. 

In mid-1970s. Robert Solovay and Volker Strassen gave a randomized 
algorithm for testing the primality of a number. (A deterministic polynomial 
algorithm was given by Manindra AgrawaL Neeraj Kayal and Nitein Saxena 
of IIT Kanpur in 2003.) This led to the modification of the Church thesis. 
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Strong Church-Turing Thesis 

Any algorithmic process can be simulated efficiently using a nondeterministic 
Turing machine . 

In 1985, David Deutsch tried to build computing devices using quantum 
mechanics. 

Computers are physical objects, and computations are physical processes. 
What computers can or cannot compute is determined by the law of 
physics alone , and not by pure mathematics 

—David Deutsch 

But it is not known whether Deutsch's notion of universal quantum 
computer will efficiently simulate any physical process. In 1994, Peter Shor 
proved that finding the prime factors of a composite number and the discrete 
logarithm problem (i.e. finding the positive value of s such that b - a s for the 
given positive integers a and b) could be solved efficiently by a quantum 
computer. This may be a pointer to proving that quantum computers are more 
efficient than Turing machines (and classical computers). 

12.8.3 Power of Quantum Computation 

In classical complexity theory, the classes P and NP play a major role, but 
there are other classes of interest. Some of them are given below: 

L—The class of all decision problems which may be decided by a TM 
running in logarithmic space. 

PSP ACE—The class of decision problems which may be decided on a Turing 
machine using a polynomial number of working bits, with no limitation on the 
amount of time that may be used by the machine. 

EXP—The class of all decision problems which may be decided by a TM in 
exponential time, that is, 0(2 uk ), k being a constant. 

The hierarchy of these classes is given by 

LcPcNPc PSPACE c EXP 

The inclusions are strongly believed to be strict but none of them has been 
proved so far in classical complexity theory. 

We also have two more classes. 

BPP—The class of problems that can be solved using the randomized 
algorithm in polynomial time, if a bounded probability of error (say 1/10) is 
allowed in the solution of the problem. 

Bop—The class of all computational problems which can be solved 
efficiently (in polynomial time) on a quantum computer where a bounded 
probability of error is allowed. It is easy to see that BPP c BQP. The class 
BQP lies somewhere between P and PSPACE, but where exactly it lies with 
respect to P. NP and PSPACE is not known. 
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It is easy to give non-constructive proofs that many problems are in EXP, 
but it seems very hard to prove that a particular class of problems is in EXP 
(the possibility of a polynomial algorithm of these problems cannot be ruled 
out). 

As far as quantum computation is concerned, two important classes are 
considered. One is BQP. which is analogous to BPP The other is NPI 
(NP intermediate) defined by 

NPI — The class of problems which are neither in P nor AfP-complete 

Once again, no problem is shown to be in NPI. In that case P ■£ NP is 
established. 

Two problems are likely to be in NPI, one being the factoring problem 
(i.e. given a composite number n to find its prime factors) and the other being 
the graph isomorphism problems (i.e. to find whether the given undirected 
graphs with the same set of vertices are isomorphic). 

A quantum algorithm for factoring has been discovered. Peter Shor 
announced a quantum order-finding algorithm and proved that factoring could 
be reduced to order-finding. This has motivated a search for a fast quantum 
algorithm for other problems suspected to be in NPI. 

Grover developed an algorithm called the quantum search algorithm. A 
loose formulation of this means that a quantum computer can search a 
particular item in a list of N items in 0(s[n ) time and no further improvement 
is possible. If it were <9(log N ). then a quantum computer can solve an NP - 
complete problem in an efficient way. Based on this, some researchers feel that 
the class BQP cannot contain the class of AP-complete problems. 

If it is possible to find some structure in the class of AP-complete 
problems then a more efficient algorithm may become possible. This may 
result in finding efficient algorithms for A 7 P-complete problems. If it is 
possible to prove that quantum computers are strictly more powerful than 
classical computers, then it will follow that P is properly contained in 
PSPACE. Once again, there is no proof so far for P c PSPACE. 


12.8.4 Conclusion 

Deutsch proposed the first blueprint of a quantum computer. As a single qubit 
can store two states 0 and 1 in quantum superposition, adding more qubits to 
the memory register will increase the storage capacity exponentially. When 
this happens, exponential complexity will reduce to polynomial complexity. 
Peter Shoris algorithm led to the hope that quantum computer may work 
efficiently on problems of exponential complexity. 

But problems arise at the implementation stage. When more interacting 
qubits are involved in a circuit, the surrounding environment is affected by 
those interactions. It is difficult to prevent them. Also quantum computation 
will spread outside the computational unit and will irreversibly dissipate useful 
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information to the environment. This process is called decoherence . The 
problem is to make qubits interact with themselves but not with the 
environment. Some physicists are pessimistic and conclude that the efforts 
cannot go beyond a few simple experiments involving only a few qubits. 

But some researchers are optimistic and believe that efforts to control 
decoherence will bear fruit in a few years rather than decades. 

It remains a fact that optimism, however overstretched, makes things 
happen. The proof of Fermat's last theorem and the four colour problem are 
examples of these. Thomas Watson, the Chairman of IBM. predicted in 1943, 
"I think there is a world market for maybe five computers”. But the growth 
of computers has very much surpassed his prediction. 

Charles Babbage (1791-1871) conceived of most of the essential elements 
of a modem computer in his analytical engine. But there was not sufficient 
technology available to implement his ideas. In 1930s, Alan Turing and 
John von Neumann thought of a theoretical model. These developments in 
‘Software* were matched by ‘Hardware' support, resulting in the first 
computer in the early 1950s. Then, the microprocessors in 1970s led to the 
design of smaller computers with more capacity and memory. 

But computer scientists realized that hardware development will improve 
the power of a computer only by a multiplicative constant factor. The study 
of P and NP led to developing approximate polynomial algorithms to 
AP-complete problems. Once again the importance of software arose. Now the 
quantum computers may provide the impetus to the development of computers 
from the hardware side. 

The problem of developing quantum computers seems to be very hard but 
the history of sciences indicates that quantum computers may rule the universe 
in a few decades. 

12.9 SUPPLEMENTARY EXAMPLES 


EXAMPLE 12.4 


Suppose that there is an /VP-complete problem P that has a deterministic 
solution taking O(n logn ) time (here log n denotes log^n). What can you say 
about the running time of any other A'P-complete problem Q'l 

Solution 

As Q e NP. there exists a polynomial p(n) such that the time for reduction 
of Q to P is atmost pin). So the running time for Q is 0(p(n) + p{n) ] ° mu) ). 
As pin) logpini dominates pin), we can omit pin) in pin) + p(n) logpln K If the 
degree of pin) is k, then pin) - 0(n k ). So we can replace p(n) by n k . So 
p(nf 0$pw) = 0((n k ) klogir ) = 0(n Mogn ). Hence the running time of Q is 0 (rlos?u 
for some constant c. 
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EXAMPLE 12.5 


Show that P is closed under (a) union, (b) concatenation, and (c) comple¬ 
mentation. 

Solution 

Let L] and L 2 be two languages in P. Let w be an input of length n. 

(a) To test whether w e L x u L 2 , we test whether w e L x . This takes 
polynomial time p(n). If w £ L x , test another w e L 2 . This takes 
polynomial time q(n). The total time taken for testing whether 
w e L { u Li is pin) + q(n ), which is also a polynomial in n. Hence 
L{ u l 2 e P. 

(b) Let >v = X\X 2 . • • x iv For each h 1 < k < n - 1 , test whether x { x 2 . . . x k 
e L x and x k+x x k+2 . . . x n e L y If this happens, w e L X L 2 . If the test 
fails for all k, w £ L { L 2 . The time taken for this test for a particular 
k is p(n) + q{n\ where p{n) and q(n) are polynomials in n. Hence the 
total time for testing for all k' s is at most n times the polynomial 
pin) + q(n). As n(pn) + q(n) is a polynomial, L X L 2 € P. 

(c) Let M be the polynomial time TM for We construct a new TM 
Mi as follows: 

1. Each accepting state of M is a nonaccepting state of M x from 
which there are no further moves. So if M accepts w, M x on 
reading w will halt without accepting. 

2. Let q f be a new state, which is the accepting state of M x . If 
8{q y a) is not defined in M, define 8 M {q, a) = (q^ a , R). So, 
tv g L if and only if M accepts w and halts. Also M x is a 
polynomial-time TM. Hence L{ e P. 


EXAMPLE 12.6 


Show that even 7 language accepted by a standard TM M is also accepted by 
a TM M\ with the following conditions: 

1. M{ s head never moves to the left of its initial position. 

2. Mi will never write a blank. 

Solution 

It is easy to implement Condition 2 on the new machine. For the new TM, 
create a new 7 blank b\ If the blank is written by M, the new Turing machine 
writes b\ The move of this new 7 TM on seeing lV is the same as the move of 
M for b. The new TM satisfies the Condition 2. Denote the modified TM by 
M itself. Define the modified M by 

M = (Q, X, r. S, q 2 , b, F) 

Define a new 7 TM M x as 

M, = (Q h I x {b}, r,. <5,, [b, b}, FO 




Chapter 12: Complexity B 367 


where 

Qi= {q o* qi } {u. L}) 

T\ = (rxr)u {[x, *] \x e T} 

q$ and q x are used to initiate the initial move of M. The two-way infinite tape 
of M is divided into two tracks as in Table 12.4. Here * is the marker for the 
leftmost cell of the low r er track. The state [q, U\ denotes that simulates M 
on the upper track. [q y L\ dentoes that M\ simulates M on the lower track. If 
M moves to the left of the cell with *, M\ moves to the right of the lower 
track. 


TABLE 12.4 Foided Two-way Tape 




x 2 





X-1 

X-2 

X-3 




We can define F { of M x by 

F\ = F x {U, L } 

We can describe 5 as follows: 

1 . 8 { (q 0 , [a, b]) = (q h [a, *], R ) 

S(q i- [X, b\) = ([q 2 . U], [X, b], L) 

By Rule 1, M-. marks the leftmost cell in the lower track with * and 
initiates the initial move of M. 

2. If 8(q, X) - {p, Y, D) and Z e T. then: 

(i) 8y([q, U], [X, Z]) = (Ip, U], [K Z], D) and 

(ii) 8,({q, L], [Z, X]) = ([ p, L], [Z, Yl 5) 
w^here D = L if D = R and D - R if D = L. 

By Rule 2. M x simulates the moves of M on the appropriate track. In 
(i) the action is on the upper track and Z on the lower track is not 
changed. In (ii) the action is on the low 7 er track and hence the 
movement is in the opposite direction D ; the symbol in the upper 
track is not changed. 

3. If S(q. X) = (p t 7, R) then 

8 { ([q, LI [W *]) = <5,([< 7 , £/], [X *]) = ([p, U}, [Y, *], R) 

When Mi see * in the lower track, M moves right and simulates M 
on the upper track. 

4. If 8{q, X) = (p. Y. L), then 

Si([q, L], [X, *]) = 8 x ([q. Ul‘[X, *]) = ([p 9 ZJ. [K *], R) 

When M x sees * in the lower track and M’s movement is to the left 
of the cell of the two-way tape corresponding to the * cell in the 
lower track, the M‘s movement is to X_ { and the M {s movement is 
also to X_\ but tow^ards the right. As the tape of M is folded on the 
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cell with *. the movement of M to the left of the * cell is equivalent 
to the movement of M { to the right. 

M reaches q in F if and only if M\ reaches [ q , L] or [q, R], Hence 
T(M) = T(M X ). 


EXAMPLE 12.7 


We can define the 2SAT problem as the satisfiability problem for boolean 
expressions written as a of clauses having two or fewer variables. Show that 
2SAT is in P. 

Solution 

Let the boolean expression E be an instance of the 2SAT problem having n 
variables. 

Step 1 Let E have clauses consisting of a single variable (xy or 3c/). If (x,*) 
appears as a clause in E y then x l has to be assigned the truth value T in order 
to make E satisfiable. Assign the truth value T to x h Once xy has the truth value 
T } then (x/ v xy) has the truth value T irrespective of the truth value of xy (Note 
that Xj can also be x ;)■ So (x,- v xy) or (xy v xy) can be deleted from E. If 
E contains (3t/ v xy) as a clause, then xy should be assigned the truth value T 
in order to make E satisfiable. Hence we replace (xy v xj) by xj in E so that 
Xj should be assigned the truth value T is order to make E satisfiable. Hence 
we replace (*,■ v Xj) by Xj in E so that xy can be assigned the truth value T 
later. If we repeat this process of eliminating clauses with a single variable (or 
its negation), we end up in two cases. 

Case 1 We end up with (xy-) a (x/). In this case E is not satisfiable for any 
assignment of truth values. We stop. 

Case 2 In this case all clauses of E have two variables. (A typical clause is 
Xj v Xj or X/ V Xj.) 

Step 2 We have the apply step 2 only in Case 2 of step 1. We have already 
assigned truth values for variables not appearing in the reduced expression E. 
Choose one of the remaining variables appearing in E. If we have chosen xy, 
assign the truth value T to x ? % Delete x ? - v xy or xy v x/ from E. If Xj v xj 
appears in E, delete 3c / to get (xy). Repeat step 1 for clauses consisting of a 
single variable. If Case 1 occurs, assign the truth value F for x,- and proceed 
with E that we had before applying step 1. 

Proceeding with these iterations, we end up either in unsatisfiability of E 
or satisfiability of E . 

Step 2 consists of repetition of step 1 at most n times and step 1 requires 
O(n) basic steps. 
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Let n be the number of clauses in E. Step 1 consists of deleting (x; v xj) 
from E or deleting xt from (x/ v xj. This is done at most n times for each 
clause. In step 2, step 1 is applied at most two times, one for x l and the second 
for xr As the number of variables appearing in E is less than or equal to n, 
we delete (x ; - v xj) or delete xi from (x,- v xj) at most 0(n) times while 
applying steps 1 and 2 repeatedly. Hence 2SAT is in P. 


SELF TEST 

Choose the correct answer to Questions 1-7: 

1. If f(ji) = 2n 3 + 3 and g(n) = lOOQOra 2 + 1000, then: 

(a) the growth rate of g is greater than that of /. 

(b) the growth rate of / is greater than that of g. 

(c) the growth rate of / is equal to that of g. 

(d) none of these. 

2. If/(n) = h ? + 4n + 7 and g(n) = 1000n 2 + 10000. then f(n) + g(n) is 

(a) OUr) 

(b) 0 (h) 

(c) <9(n ? ) 

(d) 0(n 5 ) 

3. If/(n) = 0(n k ) and g(n) = 0(n‘), then f(n)g(n) is 

(a) max{^, /} 

(b) k + 1 

(c) kl 

(d) none of these, 

4. The gcd of (1024. 28) is 

(a) 2 

(b) 4 

(c) 7 

(d) 14 

5. F 10.71 + ’9.9] is equal to 

(a) 19 

(b) 20 

(c) 18 

(d) none of these. 

6 . log ; 1024 is equal to 

(a) '8 

(b) 9 

(c) 10 

(d) none of these. 
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7. The truth value of fix. y, z) = Cv v —y) a (—\x v y) a c is T if .r, y, z 
have the truth values 

(a) T. T. T 

(b) F. F, F 

(c) T, F. F 

(d) F. T. F 

State whether the following Statements 8-15 are true or false. 

8. If the truth values of .v, y. ; are T. F. F respectively, then the truth value 
of/(.r, y, z) = x a — i(y v z) is T. 

9. The complexity of a A-tape TM and an equivalent standard TM are the 
same. 

10. If the time complexity of a standard TM is polynomial, then the time 
complexity of an equivalent /.'-tape TM is exponential. 

11. If the time complexity of a standard TM is polynomial, then the time 
complexity of an equivalent NTM is exponential. 

12. fix. v. z) = (x v v v ;) a (—i .y a —i y a —i z) is satisfiable. 

13. f(x. y. ;) = (x v y) a (—. _v a y) is satisfiable. 

14. If / and g are satisfiable expressions, then / v g is satisfiable. 

15. If/and g are satisfiable expressions, then / a g is satisfiable. 


EXERCISES 

12.1 If fin) = 0(n k ) and gin) = Off), then show that fin) + g(n ) = Off) 
where t = max{A, /} and f(n)gin) = 0(n k+l ). 

12.2 Evaluate the growth rates of (i) fin) = 2 n~. (ii) gin) = 10/r + In log n + 
log n. (iii) hin) = if log n + 2 n log;; + In + 3 and compare them. 

12.3 Use the O-notation to estimate (i) the sum of squares of first n natural 
numbers, (ii) the sum of cubes of first n natural numbers, (iii) the sum 
of the first n terms of a geometric progression whose first term is a and 
the common ratio is r, and (iv) the sum of the first n terms of the 
arithmetic progression whose first term is a and the common difference 
is d. 

12.4 Show that fin) = 3/r log : n + 4n log-, n + 5 log 2 log 2 n + log 72 + 100 
dominates nr but is dominated by n'. 

12.5 Find the gcd (294. 15) using the Euclid's algorithm. 

12.6 Show- that there are five truth assignments for (P, Q, R ) satisfying 
P v (—1 Pa —1 Q a R). 
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12.7 Find whether (P a Q a R) a — ■ Q is satisfiable. 

12.8 Is f(x, v, 4 w) = (jc v y v z) a (Ivy vz) satisfiable? 

12.9 The set of all languages whose complements are in NP is called 
CO-NP. Prove that NP = CO-NP if and only if there is some 
jVP-complete problem whose complement is in NP. 

12.10 Prove that a boolean expression E is a tautology if and only if -i E is 
unsatisfiable (refer to Chapter I for the definition of tautology). 



Answers to Self-Tests 



Chapter 1 

1 . (d) 2 . (a) 

5. (b) 6. F 


Chapter 2 


1. (b) 

2. 

(c) 

5. (c) 

6. 

(b) 

9. (d) 

10. 

(d) 

Chapter 3 

1. (d) 

2. 

(a) 

5. (d) 

6. 

T 

9. T 

10. 

F 

13. F 

14. 

T 

Chapter 4 

1. (b) 

2. 

(d) 

5. (b) 

6. 

(a) 

9. (a) 

10. 

(b) 

13. F 

14. 

T 

17. F 

18. 

T 

Chapter 5 

C (a) 

2. 

(d) 

5. (a) 

6. 

(d) 

9. (d) 

10. 

(a) 

13. F 

14. 

T 

17. F 


3. 

(c) 

4. 

(c) 

7. 

F, T, F; 

F, T, T 8. 

r 

3. 

(a) 

4. 

(b) 

7. 

(c) 

8. 

(d) 

3. 

(d) 

4. 

(a) 

7. 

F 

8. 

T 

11. 

T 

12. 

T 

15. 

F 



3. 

(c) 

4. 

(a) 

7. 

(d) 

8. 

(a) 

11. 

(c) 

12. 

(b) 

15. 

F 

16. 

T 

19. 

F 

20. 

F 

3. 

(a) 

4. 

(d) 

7. 

(b) 

8. 

(b) 

11. 

F 

12. 

F 

15. 

F 

16. 

T 
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374 B Answers to Self-Tests 


Chapter 6 


1. (a) A 


(b) 

Yes 

(c) 

Yes 



(e) Lables for nodes 1-14 are A, 

b , A, A, A, fc, a. 

a, A, 

A, b, 

a , 

a. 







2. (a) F 


(b) 

T 

(C) 

T 

(d) 

F 

(e) T 








3. (a) T 


(b) 

T 

(C) 

T 

(d) 

F 

(e) F 


(f) 

T 

(g) 

F 

(h) 

T 

Chapter 7 








1 . (a) 


2 . 

(b) 

3. 

(d) 

4. 

(a) 

5. (c) 


6 . 

(a) 

7. 

input string to S 



8 . looking ahead by one symbol 





9. (q fi a, 

a) for 

some 

■ q* e F and a 

e r* 



H-i 

© 

s\ 

> 

a) for 

some 

state q. 





Chapter 8 








1 . (c) 


2 . 

(a) 

3. 

(a) 

4. 

(c) 

5. (b) 








Chapter 9 








1 . (d) 


2 . 

(b) 

3. 

(a) 

4. 

(d) 

5. (a) 


6 . 

(c) 

7. 

(b) 

8 . 

(d) 

9. (b) 


10. 

(b) 





Chapter 11 








1 . (a) 


2 . 

(c) 

3. 

(a) 

4. 

(b) 

5. (b) 


6 . 

(a) 

7. 

(c) 

8. 

(d) 

9. (a) 


10. 

(b) 

11. 

T 

12. 

T 

13. F 


14. 

T 

15. 

T 



Chapter 12 








1 . (b) 


2. 

(c) 

3. 

(b) 

4. 

(b) 

5. (a) 


6 . 

(c) 

7. 

(a) 

8. 

T 

9. F 


10. 

F 

11. 

T 

12. 

F 

13. T 


14. 

T 

15. 

F 





Solutions (or Hints) to 
Chapter-end Exercises 


Chapter 1 

1.1 All the sentences except (g) are propositions. 

1.2 Let L, E, and G denote a < b, a = b and a > b respectively. Then 
the sentence can be written as 

( E a —s G a —! IS) v (G a —i E a —i IS) v (X a ~i E a —i G). 

1.3 (i) David gets a first class or he does not get a first class. Using the 
truth table given in Table Al.l,v is associative since the columns 
corresponding to (P v Q) v R and P v (Q v R) coincide. 


TABLE A1.1 Truth Table for Exercise 1.3 


p 

Q 

R 

P v Q 

O 

<1 

T) 

(P v Q) v R 

P v (Q v R) 

T 

T 

T 

F 

F 

7 

7 

T 

T 

f 

F 

T 

F 

F 

T 

F 

T 

T 

T 

F 

F 

T 

F 

F 

T 

F 

7 

7 

F 

T 

T 

T 

F 

F 

F 

F 

T 

F 

7 

T 

7 

7 

F 

F 

T 

F 

T 

7 

7 

F 

F 

F 

F 

7 

F 

F 


The commutative and distributive properties of Exclusive OR can be 
proved similarly. 

1.4 Using v and —■, all other connectives can be described. P a Q and 
P => Q can be expressed as ~i (—i P v -i Q) and -iPvg respectively. 

1.5 -x P = P ^ P 

P A Q = (P T Q) T (P T Q) 

P v Q = (P T P) t (Q T 0 

Verify these three equations using the truth tables. 
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1.6 

1.7 

p = p i p 

P V Q = (P i Q) 1 (P i 0 

P a Q = (P i P) i (Q i Q) 

(a) The truth table is given in Table A1.2. 

TABLE A1.2 Truth Table for Exercise 1.7(a) 


P Q 

R P v Q P v R 

RvQ P v R => R v Q PvQ=>((P 

vR)^(RvQ)) 

T T 

T T T 

T T 

T 

T T 

F T T 

T T 

T 

T F 

T T T 

T T 

T 

T F 

F T 7 

F F 

F 

F T 

T T T 

T T 

T 

F T 

F T F 

T T 

T 

F F 

TFT 

T T 

T 

F F 

F F F 

F T 

T 

1.8 

(a) —i P => (—i P a 

Q) = 1 ( 1 P) v ( i P A Q) 

by / 12 



m P v (-, P a Q) 

by h 



= (P V —1 P) A (P V Q) 

by / 4 



S T A CP v 0 

by / 8 



= P V Q 

by h 


—, P => (-, P => (-, P A 0) S —. (—, P) V (P V 0 

by 1 12 



= fv(Pv0 

by / 7 



= p v e 

by / 3 and /; 

1.9 

We prove / 5 and / 6 using the truth table. 



TABLE A1.3 Truth Table for Exercise 1.9 


P 

Q P a Q —r P 

' -i Q Pv(PaQ) -i (P a Q) —iPv—,Q 

T 

T T F 

FTP 

F 

T 

F F F 

T T T 

T 

F 

TFT 

F F T 

T 

F 

F F T 

T F T 

T 


P v (P a Q) = P since the columns corresponding to P and 
P v CP a Q) are identical; —i (P a © = -p v —j g is true since the 
columns corresponding to —» (P a 2) and n? v ng are identical. 

1.11 We construct the truth table for (P =$ — ! P) => —i P. 


TABLE A1.4 Truth Table for Exercise 1.11 


P 

nP 

p => ^p 

(P => -,F) => -tF 

T 

F 

F 

T 

F 

T 

T 

T 


As the column corresponding to (P => —i P) —■ P has P for all 

combinations, (P => -i P) => —i P is a tautology. 
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1.12 Pa(P=>-i0sPa(-,Pv-.0s(Pa-iP)v(Pa 
F v (P a —i Q) = P a —\ Q 

so (p a (p => 0) v (e => 0 s (p a q) v (-, e v 

(P a —i 0 v -i (2 = —i <2- Hence 


((P A (P 

=> -> 0) V 

(Q => -i 

0) => 

->Q ; 

^ (-. Q => ■ 

-0S 

V —1 

Q * 

= e' 

* ->Q ' 

= T 






1.13 a = 

(Q 

A —i 

R A — i 

5) v (P 

A 

5) 




= 

(Q 

A —i 

R A — i 

5) v (P 

A 

S A 

Q) v 

(P A S A 


= 

(!Q 

A —i 

R A — i 

5) v (<2 

A 

P A 

S) v 

(-1 Q A P 

A »S) 


1 0 - 
i0 = 
(i 0 


1.14 Let the literals be P, (A P- Then 
a h 110 v 100 v 010 v 000 

s ((P A Q A P) V (P A Q A P)) V (010 V 000) 

= (P A —i P) V (—1 P A Q A —1 P) V (—1 P A —I (2 A —1 P)) 

= (P A — i P) V (—i P A — I P) 

3 -iP 


1.15 


(a) The given premises are: (i) P => Q, (ii) R => —\ Q. To derive 
P => -i P. we assume (iii) P as an additional premise and deduce —i R. 

1. P Premise (iii) 

2. P => Q Premise (i) 


3. Q 

4. - (-. 0 

5. R 

6. —i R 

7. P => —i P 


Rh 

h 

Premise (ii) 
P/ 5 

Lines 1 and 6 


Hence the argument is valid. 


(b) Valid 


(c) Let the given premises be (i) P 9 (ii) Q . (iii) -i Q =$ R, 
(iv) Q -^R. Then 
L g Premise (ii) 

2. i R Premise (iv) 

Hence the given argument is valid. 


(d) Let the given premises be (i) 5, (ii) P, (iii) P Q a R, 
(iv) e v S ^ r. Then 

1. P Premise (ii) 

2. Q a R Premise (iii) 

3. Q RI 3 

4. Q v 5 fl/j 

5. 7 Premise (iv) 

Hence the argument is valid. Note that in (c) and (d) the conclusions 
are obtained without using some of the given premises. 



378 B Solutions (or Hints) to Chapter-end Exercises 


1.16 We name the propositions in the following way: 
R denotes ‘Ram is clever’. 

P denotes 4 Prem is well-behaved’. 

J denotes 4 Joe is good’. 

S denotes 4 Sam is bad’. 

L denotes 4 Lal is educated’. 


The given premises are (i) R => P, (ii) J => S a —■ P, (iii) L ^ 
J v P. We have to derive LahP^S. Assume (iv) L a —■ P as an 
additional premise. 


1 . L a -iP 

2 . L 

3. —i P 

4. J v R 

5. R ^ P 

6. nP 

7. 7 

8. S a —i P 

9. 5 

Hence, L a 


Premise (iv) 

P / 3 

Rh 

Premise (iii) 

Premise (i) 

Line 3, Premise (i) and P/ 5 
Lines 5 and 4 and P/ 6 
Premise (ii) 

Rl 3 

-.P => 5. 


1.17 The candidate should be a graduate and know Visual Basic, JAVA and 
C++. 


1.18 {5, 6, 7, . . 

1.19 Let the universe of discourse be the set of all complex numbers. Let 
P(x) denote 4 x is a root of r + at + b = O’. Let a and b be nonzero 
real numbers and b ^ 1. Let P(x) denote x is a root of r + at + b 
= O’. Let Q(x) denote x is a root of bt + at + 1 = O’. If x is a root 
of r + ar + b - 0 then Vx is a root of br + ar + 1= 0. But x is a 
root of t 2 + at + £> = 0 as well as that of br + at + 1 = 0 only when 
x = ±1. This is not possible since b & 1. So, 3x(P(x) => Q(x)) <^> 
(3x P(x) 3x Q (x)) is not valid. 

1.20 Similar to Example 1.22. 

1.21 Let the universe of discourse be the set of all persons. Let P(x) denote 
‘x is a philosopher’. Let Q(x) denote x is not money-minded’. Let 
R(x) denote ‘x is not clever’. Then the given sentence is 

(Vx (P(x) => Q(x))) a (3x(-i Q(x) a R(x))) => (3x(^ P(x) a R(x))) 

1. -i Q(c) a R(c) Rl u 

2. g(c) RI 3 

3. —iP(c) P/ 5 

4. P(c) Line I and P/ 3 

5. -i 2( c ) A R( c ) Lines 2 and 4 
Hence the given sentence is true. 

1.22 Similar to Exercise 1.21. 
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Chapter 2 

2.1 (a) The set of all strings in { a , b , c}*. 

(b) A n £ = {£}. Hence (A n 5)* = {^ /z 1 77 > 0}. 

(c) The set of all strings in { a , b, c}* which are in {a, Z?}* or in 
{ b , c}*. 

(d) A* n £* = > 0}. 

(e) A - B = {a}. Hence (A - 5)* - {d"|n > 0}. 

(f) (B - A)*.= {c 1l \n > 0}. 

2.2 (a) Yes 

(b) Yes 

(c) Yes. The identity element is A. 

(d) o is not commutative since x o y ^ y o x when x = ab and 
y - ba ; in this case x o y = abba and j o 1 = baab . 

2.3 (a) o is commutative and associative, (b) 0 is the identity element 

with respect to o. (c) A u B = A u C does not imply that B = C. 
For example, take A = { a , &}, 2? = {b, c} and C = {c}. Then A u 
5 = A u C = {a, b 9 c}. Obviously, B & C. 

2.4 (a) True. 1 is the identity element. 

(b) False. 0 does not have an inverse. 

(c) True. 0 is the identity element. 

(d) True. 0 is the identity element. The inverse of A is A c . 

2.5 (c) Obviously, mRm. If mRn , then m - n = 3 a. So, n - m = 3(-a). 

Hence n/?m. If mi?/? and then m - n - 3 a and n - p = 3b. 

m - p = 3(a + i), i.e. 

2.6 (a) 2? w not reflexive. 

(b) 2? is neither reflexive nor transitive. 

(c) 2? is not symmetric since 22?4, whereas 42^2. 

(d) A is not reflexive since 12?'l (1 + 1 ^ 10). 

2.7 An equivalence class is the set of all strings of the same length. There 
is an equivalence class corresponding to each non-negative number. 
For a non-negative number /?, the corresponding equivalence class is 
the set of all strings of length n. 

2.8 R is not an equivalence relation since it is not symmetric, for example, 
abRaba , whereas abaR'ab. 

2.9 R = {(1, 2), (2, 3), (1, 4), (4, 2), (3, 4)} 

R 2 = {(1, 3), (2, 4), (1, 2), (4, 3), (3, 2)} 

R 3 = {(1. 4), (2, 2), (1, 3), (4, 4), (3. 3)} 

R 4 = {(1, 2), (2, 3), (1, 4), (4, 2), (3, 4)} = R 

Hence 

R + = R kj R 2 u R 3 
R* = u {(1, 1)} 

2.11 R + = R* = R, Since R 2 = R (an equivalence relation is transitive). 
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2.12 Suppose/(x) = /(y). Then ax = ay\ So, x = y. Therefore, / is one-one. 
f is not onto as any string with b as the first symbol cannot be written 
as fix) for any x e {a, b}*, 

2.14 (a) Tree given in Fig. 2.9. 

2.15 (a) Yes. 

(b) 4. 5, 6 and 8 

(c) L 2, 3 and 7 

(d) 3 (The longest path is 1 —> 3 —> 7 —> 8) 

(e) 4- 5-6-8 

(f) 2 

(g) 6 and 7 

2.16 Form a graph G whose vertices are persons. There is an edge 
connecting A and B if A knows B. Apply Theorem 2.3 to graph G. 

2.17 Proof is by induction on |X|. When |X| = 1. X is a singleton. Then 
2 X = {0. X}. There is basis for induction. Assume \2 X \ = 2^ when 
X has n - 1 ‘elements. Let Y = {a^ a 2 , . . a n } 

Y - X u where X = {aj. a 2 , . . .. a H _|}. Then X has n - 1 
elements. As X has n - 1 elements. |2 X | = 2 [A1 by induction hypothesis. 

Take any subset Y { of Y. Either Y\ is a subset of X or Y { - {a„} 
is a subset of X. So each subset of Y gives rise to two subsets of X. 
Thus. |2 } ] = 2|2 X |. But |2 X | = 2 ix \ Hence |2 } ] = 2 |y L By induction the 
result is true for all sets X. 

2.18 (a) When n ~ 1. I 2 = - = 1. Thus there is basis for 

6 

induction. Assume the result for n - 1. Then 

n n-1 

k -1 k =1 

(/? - 1 )(;? - 1 + 19 ( 2 /? - 1 ) -> 

- ---g— -+ rC [by induction hypothesis] 

77(72 + 1 )( 2 « + 2 ) . ... . 

= —-- on simplification. 

6 

Thus the result is true for n. 

(c) When /? = 2, 10 2/! - 1 = 9999 which is divisible by 11. Thus 
there is basis for induction. Assume 10 2(?M) - 1 is divisible by 11. 
Then. 10 ; " - 1 = lO'lO 3 " " n - 1 = 10 2 [10 2 °'' 11 - 1] + 10 : - 1. As 
10 _l n—11 - 1 and 10" - 1 are divisible by 11, TO 2 " - 1 is divisible by 
11. Thus the result is true for 77 . 
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2.19 (b) 2? > 2 [Basis for induction]. Assume 2 n ~ 1 > n - 1 for n > 2. 
Then, 2” = 2 ■ T ~ 1 > 2 (n - 1), i.e. 2 n > n + n - 2 > n (since 
n > 2). The result is true for n. By the principle of induction, the 
result is true for all n > 1. 

2.20 (a) When n = 1, F(2n + 1) = F( 3) = F(l) + F( 2) = F(0) + F(2). So, 

i 

F(2n + 1) = 2 F(2*). 

A=Q 

Thus there is basis for induction. Assume 

F(2n - 1) = ^ F(2fc) [by induction hypothesis] 

*=o 

F(2t 7 + 1) = F(2n - 1) + F(2n) [by definition] 

By induction hypothesis, 

n-] n 

F(2n + 1) =y F(2k) + F(2n) = ^ F(2Jfc) 

A=0 . A'=0 

So the result is true for n. 

2.21 In a simple graph, any edge connects two distinct nodes. The 
number of ways of choosing two nodes out of n given nodes is 

”C, - n{ n ~ ^ . So the maximum number of edges in a simple graph 
2 

. 77(72 - 1 ) 

is -—. 

2.22 We prove by induction on |w|. When w = A, we have abA = A ab. 
Clearly, |A| = 0, which is even. Thus there is basis for induction. 
Assume the result for all w with |w| < n. Let w be of length n and 
abw — wab. As abw = wab , w = abw\ for some w in { a , b }*. So 
ababwy = abw^ab and hence abw j = w x ab. By induction hypothesis, 
\w[\ is even. As \w\ = \wy\ + 2, |w| is also even. Hence by the principle 
of induction, the result is true for all w. 

2.23 Let P(n) be the ‘open the 77th envelope*. As the person opens the first 
envelope, F(l) is true. Assume Pin - 1) is true. Then the person 
follows the instruction contained therein. So the 72th envelope is 
opened, i.e. P(n) is true. By induction, P(n) is true for all n. 

Cnapter 3 

3.1 101101 and 000000 are accepted by M. 11111 is not accepted by M. 

3.2 {go, g 4 }. Now, 8(q 0 , 010) = {q 0 , q 3 } and so 010 is not accepted 
by M. 
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3.3 Both the strings are not accepted by M. 

3.4 As 8(q u a) = 8(q h a ), R is reflexive. Obviously it is symmetric. 
If q\Rq 2 , then <5(# b a) = 5(^ 2? a)- 

If q 2 Rq 2 then S(q 2 , a) = <5(g 3 , a). Thus 8{q x , a) = <5(g 3 , a), implying 
that q\Rq 2 ‘ So is an equivalence relation. 

3.5 The state table of NDFA accepting {ab, ba} is defined by Table A3.1. 


TABLE A3.1 State Table for Exercise 3.5 


State/S 

a 

b 

<7o 

Qi 

q 2 

9i 


Q3 

q 2 

Qz 





The state table of the corresponding DFA 

is defined by Table A3.2. 

TABLE A3. 2 State Table of DFA for Exercise 3.5 

State/l 

a 

b 

fool 

t<7ii 

fa 2 i 

toil 

0 

Iasi 

[<fc] 

fad 

0 


0 

0 

0 

0 

0 

The NDFA accepting the 

given set of 

strings is described by 

Fig. A3.1. The corresponding state table is defined by Table A3.3. 


a, b 



TABLE A3.3 State Table for Exercise 3.6 


State!! 

a 

b 

Qo 

Qo> 9l 

Qo 



Q2 

Q2 

Qz 


Qz 




The DFA accepting the given set is defined by Table A3.4. 
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TABLE A3.4 State Table of DFA for Exercise 3.6 


State/I. 

a 

b 

too] 

too. <7i] 

toil 

too, <7i] 

too, <7il 

too, Q 2 I 

too, QA 

[Qo, <7i, %] 

too] 

too, <7i, <?3) 

[Qo, <7i! 

too, qA 


3.7 The state table for the required DFA is defined by Table A3.5. 


TABLE A3.5 State Table for Exercise 3.7 

State 

0 

1 

2 

tool 

[<7i Q4] 

toJ 

to2, <7d 

[q 4 ] 

0 

0 

0 

toi, Qd 

0 

to 4 ] 

0 

to2, <73] 

0 

[<74] 

[Q2. <fel 

0 

0 

0 

0 


3.9 The state table for the required DFA is defined by Table A3.6. 


TABLE A3.6 State Table for Exercise 3.9 


State 

0 

1 

toil 

[<72, <73] 

toi] 

[<72, <73] 

toi, qA 

toi. <7 2 ] 

toi, <72] 

toi, <72, <73] 

toi] 

toi, <72, <73] 

toi, <72, <73] 

toi, <72] 


3.10 The required transition system is given in Fig. A3.2. Let X denote 
{a, b, c, . .z}. * denotes any symbol in X - {c, r}. ** denotes any 
symbol in X - {c, a, r}. *** denotes any symbol in X. 


c 



r 


Fig. A3. 2 Transition system for Exercise 3.10 

3.11 The corresponding Mealy machine is defined by Table A3.7. 




384 fi Solutions (or Hints) to Chapter-end Exercises 


TABLE A3. 7 Mealy Machine of Exercise 3.11 

Present state 



Next 

state 



a = 0 



a = 1 



state 

output 


state 

output 

9o 

9i 

0 


92 

1 

9i 

93 

1 


92 

1 

92 

92 

1 


9i 

0 

93 

9o 

1 


93 

1 

q\ is associated with 1 and c/ 2 

is associated with 0 and 1. Similarly, t/ 3 

is associated with 0 and 1, whereas 

Cl 4 is 

associated with 1. 

. The state 

table with new states CJ 20 , <? 2 i 

, < 730 , 93 i and g 4 is defined by Table A3.8. 

TABLE A3. 8 State Table for Exercise 3.12 


Present state 



Next 

state 



a = 0 



a = 1 



state 

output 


state 

output 

9i 

9i 

0 


920 

0 

920 

94 

1 


94 

1 

921 

94 

1 


94 

1 

930 

921 

1 


931 

1 

931 

921 

1 


931 

1 

94 

930 

0 


9i 

1 


The revised state table is defined by Table A3.9. 


TABLE A3.9 Revised State Table for Exercise 3.12 


Present state 


A/ext state 


a = 0 

a = 1 

output 

-» 9o 

9i 

920 

0 

9i 

9i 

920 

1 

920 

94 

94 

0 

921 

94 

94 

1 

930 

921 

931 

0 

931 

921 

931 

1 

94 

930 

9i 

1 


3.13 The Mealy machine is described by Fig. A3.3. 


0, Even 0, Odd 



1, Even 


Fig. A3. 3 Mealy machine of Exercise 3.13. 
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3.14 Ki s are given below: 

^ = {{<7o}> {<70, <?!> <?2, <?3, <?4, <7s}} 

= {{<7e}> {<7o, <?i, <72, <73, ^?5}’ {^4}> 

^2 = {{<76}, {44}, {<7o, <7i> <73}, {<72- <75}} 

^3 = {{%}, {^4}, {<7o}, t^iK {<73}, {<72, <75}} 

^4 = {{%}, {<74}, {< 7 oh {< 7 i}. {<73}, (<72}, {<75}} 

Here n - Q. The minimum state automaton is simply the given 
automaton. 



Chapter 4 

4.1 (a) S A> 0 n Sl n =e> 0"0" Z A1 W T\ n > 0, m > 1. 

A ^ 1*A => l* + \ k > 0. 

0 /? T 2 e L(G) when n > m > 1. So L(G) = {0 m T* : n > m > 1} 

(b) L(G) = {0"T J | m ^ n and at least one of m and n > 1}. Clearly, 
0 m e L(G) and l n e L(G), where m, n > 1. 

For m > n, 5 ^ 0"S1" => 0”0A1" =* 0 n 00 m ~ n ~ l l n = (TIT Thus 
0 m r e L(G). 

(c) L(G) = {0 n l /J 0 /2 1 n > 1}. The proof is similar to that of 
Example 4.10. 

(d) L(G) = {0T'Tl H |m, n > 1}. 

For ra, n > 1, 

5 r4> o" -1 Sl' !-1 => 0' ! “'0AH"- ! => 0"l" i_1 A0'" _1 l' ,_1 =* 0"l'"0"'l" 
So, 

{OhTT^i, n > 1} c L(G). 

It is easy to prove the other inclusion. 

(e; L(G) = {x e {0, 1} + j x does not contain two consecutive 0’s} 

4.2 (a) G = ({5 1 , A, 5}, {0. 1}, P, S), where P consists of S —> OP | 1A, 
A -> 0 | OS | 1AA, 5 -> 1 \1S\ 0BB. 

Prove by induction on \w |, w e X*, that 

(i) S w if and only if w consists of an equal number of 0’s and l’s 

(ii) A => w if and only if w has one more 0 than it has l’s. 

(iii) B ==> w if and only if w has one more 1 than it has 0’s. 

A => 0, B =$ 1 and S does not derive any terminal string of length one. 
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Thus there is basis for induction. Assume (i), (ii) and (iii) are true 
for strings of length k - 1. Let w be a string in X* with \w\ - k. 
Suppose 5 => w. The first step in this derivation is either 5 => 05 or 
5 =$ 1A. In the first case w = 0w 1? 5 w { and \w { \ ~ k - 1. By 
induction hypothesis, w has one more 1 than it has 0’s. Hence w has 
an equal number of 0’s and l’s. To prove the converse part, assume 
w has an equal number of 0’s and l’s and |w| = k. If w starts with 
0, then w = Ou^ where \w { \ = k - 1. vtq has one more 1 than it has 
0’s. By induction hypothesis, 5 A Wj. Hence 5 OB => Onq = w. 
Thus (i) is proved for all strings w. The proofs for (ii) and (iii) are 
similar. 

(b) The required grammar G has the following productions: 

5 051, 5 -> 0A1, A LAO, A 10. 

Obviously, L(G) c {Q n V n 0 m l n \m, n > 1}. For getting any terminal 
string, the first production is 5 —» 051 or 5 0A1, the last 

production is A 10. By applying 5 051 (n - 1) times, 5 —» 0A1 

(once), A 1A0 (m - 1) times, and A -» 10 (once) we get 0 n l m 0 m V\ 
So, {0'T ?, 0"T | m, n > 1} c 1(G). 

(c) The required productions are 5 —» 05111Oil. 

(d) The required productions are 5 —> 0A11 150, A —> 0A11 A, B —> 
ISO | A. 

(e) Modify the constructions given in Example 4.7 to get the 
required grammar. 

4.3 For the derivation of 001100, 001010 or 01010, the first production 
cannot be 5 —» 051. The other possible productions are 5 —» 0A and 
5 —> 15. In these cases the resulting terminal strings are 0 n or 1". So 
none of the given strings are in the language generated by the 
grammar given in Exercise 4.1(b). 

4.4 It is easy to see that any derivation should start with 5 => 0A5 
0A5A => 0A0A5A or 5 => 0A5 => 0A01 => 05051. If we apply 
5 —» 0A5, we get A in the sentential, form. If we try to eliminate A 
using A0 —» 505 or A1 —» 551, we get 5 in the sentential form. So 
one of the two variables, namely A or 5, can never be eliminated. 

4.5 The language generated by the given grammar is {01 m 2 /2 3| m, n > 1}. 
This language can also be generated by a regular grammar (refer to 
Chapter 5). 

4.6 (a) False. Refer to Remark 2 on page 123. 

(b) True. If L = {w b w 2 , . . w,J, then G = ({5}, X, 5, 5), where 

P consists of 5 —> W]|w 2 | . . . \w n . 

(c) True. By Theorem 4.5 it is enough to show that {w} is regular 
where w e X*. Let w = aja 2 . . . a n . Then the grammar whose 
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productions are S —> a x A 1? A x —> ghAo, . . . A m _ x —> a m generate 

M- 

4.7 We prove (a) S => Al'AgA^ (b) A{ l A"A 4 => a n2 A"A 4 , (c) A"A"A 4 => 

a 2;z+1 . 

We first prove (a). 5 => A 3 A 4 A!A 3 A 2 A 4 Af^A^T - ^ 
A 1 ;z_ 1 A 1 A 2 A 2 i ~ 1 A 4 (We are applying A 3 —> AjAyUfa - 2) times and 
A 3 -» AiA 2 once.) Hence (a). To prove (b) start with A/ Z A 2 A 4 . Then 
A" ^AjA 2 A 2 l ! A 4 => A/* i aA 2 A 1 A 2 rt ^A 4 A{ 1 2 #AiA 2 A]A 2 z ^A 4 => 

Af“VA 2 A 1 A 1 A 2 ' a A 4 =4> A 1 / ^ 2 (rA 2 A 1 ^ 2 ^i^ 2 l_2 ^4 =S>Af~ 2 a 3 A 2 A I A 2 A 1 A 2 '' 2 A 4 => 
A/ z " 2 « 3 A 2 aA?A 1 A 1 A 2 “ 2 A 4 =^> a 4 A ( l ~ 2 AfA 1 2 A 2 / 2 ” 2 A 4 . 

2 

Proceeding in a similar way, we get AfA 2 A 4 =5> a^AoA^A^ 
Hence (b). 

Finally, A 2 A/A 4 => A 2 A 1 /z "’ 1 A 4 a ==> A 2 A 4 <P => A 2 _ 1 A 5 a 7I+1 => A 5 a 2 " 
^ £ 5 T /i+ *. (We apply AjA 4 —> A^ci. A 2 A 4 —> A 5 #, A 2 A 5 —> A 5 # and 
finally A 5 a). 

Using (a), (b) and (c), we get S => a in ^ ])2 . 

4.8 The productions for (i) are S —> aS\B, aS —> aa, B —> a. For (ii) the 
productions are S -» AS | a, A —> a. For (iii) the productions are 
5 aS | a. 

4.9 The required grammar G = ({5, 5 l? A, B }, X, P, 5), where X = 
{0, 1, 2, . . ., 9} and P consists of 

S -> 0 | 2 | 4 | 6 | 8, 5 ^ A5 b A —> 1 | 2 [ .. . | 9 
Si —> 0 | 2 | 4 | 6 | 8, *5 —^ ABS U B -> 1 | 2 | .. . | 9 
iS —> 0 j 2 | 4 | 6 | 8 generate even integers with one digit. 

S —> ASi and A-productions and Si -productions generate all even 
numbers with two digits. The remaining productions can be used to 
generate all even integers with three digits. 

4.10 (a) G = ({5, A, S}, {0, 1}, P, 5), where P consists of 5 -> 051 | 0A 
| 15 ] 0 | 1, A -> 0A | 0, B -» IB | 1. Using 5 -» 051, 5 0, 

5 —» 1, we can get 0'T, where m and differ by 1. To get more 0’s 
(than l’s) in the string we have to apply A —» 0A | 0, 5 -> 0A 
repeatedly. To get more Ps, apply 5 —» IS, B —> IB | 1 repeatedly. 

(b) The required productions are 5 —> aS 1 , 5i —» bS x c , 5j —» &c, 

5 —^ fl5 2 c, 5 2 —^ a5 2 c, 5 2 —^ b, 5 —> 5 3 c, 5 3 —> aS^b, 5 3 —> ab. The 

first three productions derive ab n c n . 5 —> a5 2 c and the 5 2 -productions 
generate a n bc 11 . The remaining productions generate a'TPc. 

(c) The required productions are 5 051, 5 —» 01, 5 —» 0A1, A —» 

1A, A —» 1. 

(d) The required productions are 5 —> «5c, 5 —> ac, 5 —> &c, 5 -» 
bS x c , 5j —> W^c, 5i —» Z?c. A typical string in the given language can 
be written in the form a l b m c 1J1 c l where 1, m > 0. 5 —» a5c, S be 
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generate a l c l for 1 > 1. 5 —» bS x c, 5j —» £>5iC, S x —> be generate fc'V" 
for m > 1. For getting a^'VV, we have to apply 5 —» aSc l times; 
5 -> be, 5 —> bS x c , 5] —> i>5ic and S x —> Z?c are to be applied. For 
m = 1, 5 —» be has to be applied. For m > 1, we have to apply 
5 —» Wqc, Sj —» and 5\ —» repeatedly. The terminal c is added 
whenever the terminal a or b is added in the course of the derivation. 
This takes care of the condition 1 + m = n. 

(e) Let G be a context-free grammar whose productions are 
5 5051505, 5 -» 5050515, 5 -» 5150505, S A. It is easy to see 

that elements in L(G) are in L. Let w e L. We prove that w e L(G) 
by induction on jw|. Note that every string in L is of length 3 n, 
n > 1. When \w\ = 3, h j has to be one of 010, 001 or 100. These 
strings can be derived by applying 5 —» 5051505, 5 —» 5050515 and 
5 —» 5150505 and then 5 A. Thus there is basis for induction. 
Assume the result for all strings of length 3 n - 3. Let w e L and let 
|w| = 3n, w should contain one of 010, 001 or 100 as a substring. Call 
the substring w x . Write w = w 2 w x w 2 . Then |w 2 w 3 | = 3n - 3 and by 
induction hypothesis 5 => w 2 w 3 . Note that all the productions (except 
5 —» A) yield a sentential form starting and ending with 5 and having 
the symbol 5 between every pair of terminals. Without loss of 
generality, we can assume that the last step in the derivation 
5 =E> w 2 w 3 is of the form w 2 5w 3 => w 2 w 3 . So, 5 => w 2 5w 3 . But 
W) e L and so 5 => w x . Thus, 5 =e> w 2 w x w 3 . In other words, w e L(G). 
By the principle of induction, L = L(G). 

4.11 The required productions are: 

(a) 5 —^ ciS\. S x —^ ciS , 5 —^ aS 2 ? 5 2 —^ ci 

(b) 5 —^ aS, 5 —^ F5, 5 —^ # 

(c) 5 —^ n5 1 , 5i —^ {25^, 5^ —^ £?5], 5] —^ n, 5^ —^ 

(d) 5 —^ <35^, 5] —^ «5i, 5^ —^ £>5 2 , 5 2 —^ £>5 2 , 5 3 —^ c5 3 , 5 3 —> c 

(e) 5 —^ n5j. 5i —^ Z?5, 5 —^ #5 2 , 5 2 —^ 

4.12 is not symmetric and so the relation is not an equivalence relation 
(Refer to Note (ii), page 109) 

4.13 It is clear that L(G X ) = [d'b" | n > 1}. In G 2 , S => AC => A5£ 

A "” 1 SB n ~\ Also, S =$ AB => ab. Hence S => d'b n for all n > 1. This 

means LyGj) = {«"&" | n > 1 } = 

4.14 L(G) = 0, for we get a variable on the application of each production 
and so no terminal string results. 

4.15 The required productions are 5 —> aS x , S x —> bS 2 , S 2 c, 5 —> Z?5 3 , 

5 3 —^ c*54, 54 —^ a, 5 —^ c5^, 55 —> ciSfa b(s —^ 

4.16 The required productions are 5 —> 5i, 5i —> abS Xi S x —> ab , 5 —> 5 2 , 
5 2 —^ baS 2 , 5 2 —^ 
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4.17 Let the given grammar be Gj. The production A ^ xB where 
x = a { a 2 ... a n is replaced by A —> a x A x , A x —> a 2 A 2 , . . ., A„_i —> 
The production A —» y, where y = . . . b m is replaced by 

A —> B ] —> b 2 B 2 , . . —> & m . The grammar G 2 whose 

productions are the new productions is regular and equivalent to G\. 

Chapter 5 

5.1 (a) 0 + 1 + 2 

(b) 1(11)* 

(c) w in the given set has only one a which can occur anywhere in 
w. So w = xay\ where x and y consist of some b’s (or none). 
Hence the given set is represented by b* ab*. 

(d) Here we have three cases: w contains no a, one a or two as. 
Arguing as in (c), the required regular expression is 

b* + b* ab* + b* ab* ab* 

(e) aa(aaa)* 

(f) (aa)* + (aaa)* + aaaaa 

(g) a(a + b)* a 

5.2 (a) The strings of length at most 4 in (ab + a)* are A, a , ab, aa, 

aab, aba , abab, cr\ aaba , aaab and a 4 . The strings in aa + b are 
aa and b. Concatenating (i) strings of length at most 3 from the 
first set and aa and (ii) strings of length 4 and b, we get the 
required stnngs. They are aa, aaa, abaa, aaad, aabaa, abaaa, 
a 5 , ababb, aabab, aaabb , and aaaab. 

(c) The strings in (ab + a) + (ab + a)“ are a , ab, aa, abab, aab and 
aha. The strings of length 5 or less in (ab + a) 3 are a 3 , abaa, 
aaab, ababa. The strings of length 5 or less in (ab + a) 4 are a 4 , 
a^ab, aba T In (ab + a) 5 , a 5 is the only string of length 5 or less. 
The strings in a* are in (ab + a)* as well. Hence the required 
strings are A, a, ab, or, abab, aab, aba, a~\ abaa, aaab, ababa, 
a 4 , aaaab, abaaa, and a 5 . 

5.3 (a) The set of all strings starting with a and ending in ab . 

(b) The strings are either strings of a s followed by one b or strings 

of £>’s followed by one a . 

(c) The set of all strings of the form vvc where a s occur in pairs in 
v and b’s occur in pairs in w. 

5.5 The transition system equivalent to (ab + a)*(aa + b) (5.2(a)) is 
given in Fig. A5.L 



Solutions (or Hints) to Chapter-end Exercises 



Fig. A5.1 Transition system for Exercise 5.5. 


5.6 The transition system equivalent to a(a + b)*ab (5.3)(a)) 
Fig. A5.2. 
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5.7 (a) 0; (b) (a + £>)*; (c) the set of all strings over { a , Z?} containing 

two successive <z’s or two successive V s; (d) the set of all strings 
containing even number of a "s and even number of b's. 

5.8 We get the following equations: 

q 0 = A 

qi = q«i + qii + q.a 

q 2 = qjO + q 2 0 + q 3 0 
qs = qii 

Therefore, 

q 2 = qiO + q 2 0 + q 2 10 = q^ + q 2 (0 + 10) 

Applying Theorem 5.1, we get 

q 2 = qi0(0 + 10)* 

Now, 

q x = 1 + q x l + q 2 H = 1 + qj + qi0(0 + 10)*11 

= 1 + qj(l + 0(0 + 10)* 11) 

By Theorem 5.1, we get 

q x = 1(1 + 0(0 + 10)*11)* 

q 3 = q 2 l = 1(1 + 0(0 + 10)11)* 0(0 + 10)*1 

As q 3 is the only final state, the regular expression corresponding to 
the given diagram is 1(1 + 0(0 + 10)* 11)* 0 (0 + 10)* 1. 

5.9 The transition system corresponding to (ab + c*)*b is given in Fig. 
A5.3. 

5.10 (a) The required regular expression 

(1 + 01 )* + (1 + 01 )* 00(1 + 01 )* + (0 + 10 )* + (0 + 10 )* 11(0 

+ 10)* (1 + 01)* represents the set of all strings containing no pair 

of 0 J s. (1 + 01)* 00 (1 + 01)* represents the set of all strings 
containing exactly one pair of 0’s. The remaining two expressions 
correspond to a pair of l’s. 

(c) Let w be in the given set L. If w has n a y s then it is in a set 
represented by (b)*. If w has only one a then it is in a set represented 
by b*ab*. If w has more than one a , write w = wiaw 2 , where w does 
not contain any a. Then wq is in a set represented by (b + abb)*. So 
the given set is represented by the regular expression b* + (b + abb)* 
ab*. (Note that the regular set corresponding to b* is a subset of the 
set corresponding to (b + abb)* 

(d) (0 + 1)* 000 (0 + 1)* 

(e) 00(0 + 1)* 

(f) 1(0 + 1)* 00. 
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Fig. A5.3 Transition system for Exercise 5.9. 


5.12 The corresponding regular expression is (0 + 1)* (010 + 0010). We 
can construct the transition system with A-moves. Eliminating 
A-moves, we get the NDFA accepting the given set of strings. The 
NDFA is described by Fig. A5.4. 




Fig. ASA NDFA for Exercise 5.12. 


The equivalent DFA is defined by Table A5.1. 
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TABLE A5.1 State Table of DFA for Exercise 5.12 


State! I 


0 

1 

tool 


too, <7i, < 73 ] 

tool 

too, <7i. <731 


too, <7i, < 73 , < 74 ] 

too. < 72 ] 

too, q-A 


too, <7i, < 73 , Qf] 

too] 

too. <7i, <73, < 74 ] 


[<7o, <7i. <73, %] 

too, <72, < 75 ] 

too, <7i, < 73 , <7f] 


too, <7i, <73, <74] 

too, < 72 ] 

too, <72, <75] 


too, <7i, <7s, <7f] 

tool 

Similar to Exercise 

3.10. 



The state table for the NDFA accepting (a + b)* abb is defined by 

Table A5.2. 




TABLE A5.2 

State Table for Exercise 5.14 

State/2, 


a 

b 

9o 


9o. 9i 

9o 

9i 



92 

92 



Qf 

Qf 




The corresponding 

DFA is 

defined by Table A5.3. 

TABLE A5.3 

State Table of DFA for Exercise 5.14 

Stated 


a 

b 

tool 


too, <7il 

tool 

too, <7il 


too, <7il 

too, < 72 ] 

too, < 72 ! 


too, <7i] 

[<7o, Qf] 

too, Qf\ 


too, <7i] 

too, <7f] 


5.15 Let L be the set of all palindromes over { a , b}. Suppose it is accepted 
by a finite automaton M = (Q, X, 5, q , F), {<5(g 0 , d l ) | n > 1} is a 
subset of Q and hence finite. So 8(q& a n ) = 8(q 0 , a m ) for some 
m and n , m < n. As a n b 2n a n e L, 5(<7 0 , a n h 2n ci u ) e F. But S(q 0 , a m ) 
= 8{q^ a Hence 8(q 0 , d n b 2n a tl ) = S(q 0i o Il b 2n a n ), which means 
a m b~ n a n e L. This is a contradiction since a m b 2}1 a n is not a palindrome 
(remember m < n). Hence L is not accepted by a finite automaton. 

5.16 The proof is by contradiction. Let L = {a n b n \ n > 0}. {8(q 0 , a n ) | 
n > 0} is a subset of Q and hence finite. So 8(q 0 , a n ) = (5(g 0 , 

for some m and n, m ^ n. So <5(g 0 , = <5(<5(<7 0? ft”) = 

S(8(q 0 , < 2 "), fr 77 ) = 5(<7 0 , As <T 7 & /7 e L, <5(g 0 , a 77 & 77 ) is a final state 
and so is 8(q 0 , a m b n ). This means d n h n s L with m & n, which is a 
contradiction. Hence L is not regular 
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5.17 (a) Let L = {a n b 2u \n > 0}. We prove L is not regular by 
contradiction. If L is regular, we can apply the pumping lemma. Let 
n be the number of states. Let w = a n b 2n . By pumping lemma, 
w - xyz with \xy\ < n, \y\ >0 and xz e L. As \xy \ < n, xy = a m and 
y = a 1 where 0 < 1 < n. So xz = a n ~ [ b 2n e L, a contradiction since 
n - 1 & n. Thus L is not regular. 

(b) Let L = {a u b m | 0 < n < mj. We show that L is not regular. Let 
n be the number of states and w = a n b m , where m > n. As in (a), 
y = a\ where 0 < 1 < n. By pumping lemma xy k z e L for k > 0. So 
a n ~ l a! k b m e L for all k > 0. For sufficiently large k, n - 1 + Ik > m. 
This is a contradiction. Hence L is not regular. 

5.19 Let M = (<2, X, <5, q 0 , F) be a DFA accepting a nonempty language. 
Then there exists w = a { a 2 . . . a p accepted by M. If p < n , the result 
is true. Suppose p > n. Let 8(q 0 , a x a 2 . . . a } ) = q { for i = 1, 2, . . .,p. 
As p > n, the sequence of states {q x , q 2 , . . q p } must have a pair 
of repeated states. Take the first pair (q p q k ) (Note q } - q k ). Then 
8(q 0 , a x a 2 .. aj) = q p 8(q p a j+l . . . a k ) = q, and 8(q p a M .. 
a p ) e F. So 8(q 0 , a x a 2 . . . aja k+l . . . a p ) = S(q p a x a 2 . .a p ) e F. 
Thus we have found a string in X* whose length is less than p (and 
differs from \w\ by k - j). Repeating the process, we get a string of 
length m, where m < n. 

5.20 Let M = ({ q 0 , q h q f }, {a, b), 5, q 0 , {q/}), where q 0 and q { correspond 
to S and A respectively. 

Then the NDFA accepting L(G) is defined by Table A5.4. 



Table 

A5.4 Table for Exercise 

5.20 

StateFL 

a 

b 


Qo 

Qo 

Vh Qf 


<7i 

<7i, Qf 

Qo 

The transitions are: 


8(q h a) 

= 

8{q 2 , b ) = q x 


8{q i, b) 

= 

8{q A , a) = q x 


S{q 2 , a) 

II 

u> 



8(q 3 , a ) 

= qi 



S(q 3 , b) 

= <74 



8(q 4 , b) 

= 



Let A ls 

a 2 , a 3 , 

A 4 correspond to q x . 

q 2 , < 73 , ^ 4 . The induced 

productions are A 

[ —^ ^A 4 , Aj —^ A 2 

—^ flA 3 , A 3 —^ tzA 2 , A 3 —^ 
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bA 4 , A 4 —> Z?A 3 (corresponding to the first six transitions) and A 2 —> 
bAi, A 2 —> Z>, A 4 —» aA b A 4 —> a (corresponding to the last two 
transitions). So G = ({A 1? A 2 , A 3 , A 4 }, {a, b }. P. A]), where P 
consists of the induced productions given above. 

5.22 The required productions are: 

5 -> AS^BSxl . .. |ZS if 
S x AS^Sj] ... |ZS l9 
5* -> 051I15J| ... |95; 

Si -> A|5| ... fZ and S x 0|1| ... |9 

5.24 The given grammar is equivalent to G = ({S, A, B}, {a , Z?}, P, S), 
where P consists of S —> aS | Z?S | aA , A —» Z?P, P —> a(P -» aC and 
C —» A is replaced by B —> a). Let q Q , q x and q 2 correspond to S, A 
and B . q f is the only final state. The transition system M accepting 
L(G ) is given as 

M = ({ q Q , q u q 2 , q,}- {a, b}, <5, q 0 , {q f }) 

where q 0 , q u and q 2 correspond to S, A and B and q f is the (only) 
final state. S —> aS, S ^ bS, S aA and A -» bB induce transitions 
from q 0 to q 0 with, labels b and a, from q 0 —> q x with label a and 
from q x to q 2 with label b. B -» a induces a transition from q 2 to qf 
with label a. M is defined by Fig. A5.5. 


a, b 



Fig A5.5 Transition system for Exercise 5.24. 


The equivalent DFA is given in Table A5.5. 


TABLE A5.5 State Table of DFA for Exercise 5.24 


State /I 

a 

b 

[Qo] 

[qo. <7il 

tool 

too, <7i] 

too. <7il 

too. %] 

too. < 72 ] 

too. <7i. Qf] 

tool 

too. <7i. Qt] 

[Qo. <7i] 

too. < 72 ] 
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Chapter 6 

6.1. The derivation tree is given in Fig. A6.1. 



6.2. 5 050, 5 —» 151 and 5 —> A give the derivation 5 => wAw , where 

w € {0, 1} + . A -> 253 and 5 -> 253 give A => 2 m 53 m . Finally, 
5^3 gives 5 => 3. Hence 5 wAw \v2 m 53 m w => w2 m 3 m + I w. 
Thus, L(G) c {w2 m 3 m+l >v | w e {0, 1}" and m > 1}. The reverse 
inclusion can be proved similarly. 

6.3 (a) X b X 3 , X 5 ; (b) X 2 , X 4 ; (c) As X { = 5, X 4 X 2 and X 2 X 4 X 4 X 2 are 
sentential forms. 

6.4 (i) 5 => SbS => abS =» abSbS => ababS ababSbS => abababS => 

abababa. 

(ii) 5 => 5fc>5 SbSbS => SbSbSbS => SbSbSba => SbSbaba 
Sbababa => abababa 

(iii) 5 => 5Z?5 => aZ?5 => abSbS => abSbSbS => ababSbS => abababS 
=> abababa 

6.5 (a) 5 => a5 => aa55 => aaaBBB => aaabBB => aaabbB => 

aaabbaBB aaabbabB => aaabbabbS => aaabbabbbA ==> 
aaabbabbba 

(b) 5 =» aB => aa55 => aaBbS => aaBbbA => aaBbba => aaaBBbba 
=> aaabBbba => aaabbSbba => aaabbaBbba => aaabbabbba . 

6.6 aZ?aZ? has two different derivations 5 => abSb => abafc (using 
5 —> abSb and 5 —» a) 5 => aAZ? =^> abSb => aZ?afr (using 5 —» aAZ?, 
A —> Z?5 and 5 —> zz). 

6.7 ab has two different derivations. 

S =$ ab (using 5 —» ab) 

S aB =$> ab (using 5 —> aB and 5 —> fo). 
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6.8 Consider G = ({5, A, P}, { a, b\. P, 5), where P consists of 
S —^ AP | cib and P —> p. 

Step 1 When we apply Theorem 6.4, we get 

W x = {5}, W 2 = {5} u {A, B, a , P} = W 3 
Hence G\ = G. 

Step 2 When we apply Theorem 6.3, we obtain 

W] = {5, P}, W 2 = {5, P} u 0 = W 3 

So, G 2 = ({5, P}, {a, b} 9 {S -» aP, B -> 6}, 5). 

Obviously, G 2 is not a reduced grammar since P —> P does not appear 
in the course of derivation of any terminal string. 

6.9 Step 1 Applying Theorem 6.3, we have 

W x = {P}, W 2 = {B} u {C, A}, W 3 = {A, 5, C} u {5} = V* 
Hence G} = G. 

Step 2 Applying Theorem 6.4, we obtain 

W x = {5}, W 2 = {5} u {A, a}. W 3 = {5, A, a} u {5. P} 

W 4 = {5, A, 5, a, P} u 0 

Hence, G 2 = ({S, A, 5}, {< 3 , P}, P, 5), where P consists of S —> aAa , 
A -» bBB and P —> ah. 

6.10 The given grammar has no null productions. So we have to eliminate 
unit productions. This has already been done in Example 6.10. The 
resulting equivalent grammar is G = ({5, A, B , C, D , P}, { a , P}, P, 
S), where P consists of S —» AP, A —» a, P —> b \ a, C —> a, P> —> a 
and E a. Apply step 1 of Theorem 6.5. As every variable derives 
some terminal string, the resulting grammar is G itself. 

Now apply step 2 of Theorem 6.5. Then 
W x = {S}, W 2 = {5} u {A, P} = {S, A, P}, W 3 = {5, A, P} u 
{cl b} = {5. A, P, a, P} and W 4 = W 3 . 

Hence the reduced grammar is G f = ({5, A, P}, {a, P), P\ S ), where 
P / = {5 —> AP, A —> a, P —> P, P —» a}. 

6.11 We prove that by eliminating redundant symbols (using Theorem 6.3 
and Theorem 6.4) and then Unit productions, we may not get an 
equivalent grammar in the most simplified form. Consider the 
grammar G whose productions are S —> AP, A —> a. B —» C, P —> P, 
G —^ Z), D —^ P and E —^ 

Step 1 Using Theorem 6.3, we get 

W x = {A, P, P}, W 2 = {A, P, P}^ u {S, D}, 

W 3 = {5, A, P, D, P} u {C} = U Y - 

Hence G| = G. 

Step 2 Using Theorem 6.4, we obtain 

W x = {S}, W 2 = {5} u {A, B} 9 W 3 = {P, A, P} u {a, c, P}, 
W 4 = {5, A, P. c, P} u {D}, 

W 5 = {5, A, P, C, A a, P} u {P} = U v u X. 

Hence G 2 = G { - G. 
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Step 3 We eliminate unit productions. We then have 
W(S) = {5} W(A) = {A], W(E) = {E} W 0 (B) = { B }, 

Wm = {B} u {C}, W 2 {B) = {B, C} u {D}, 

W 3 (B) = { B , C, D, ?} = W(«) , MO ={CA£I, W(D) = {D, ?}. 
The productions in the new grammar G 3 are S —> AB, A a , 

B —> Z?, B —> a and B —> a. G 3 contains the redundant symbol B. So 
G 3 is not the equivalent grammar in the most simplified form. 

6.12 (a) As there are no null productions or unit productions, we can 
proceed to step 2. 

Step 2 Let Gj = (V^, {0, 1}, P u 5), where ?! and V' v are 
constructed as follows: 

(i) A -» 0, B —> 1 are included in P x . 

(ii) 5 —» 1A, B —> 15 give rise to S —» QA, 5 —» QS and C\ —> 1. 

(iii) S —> OB , A —> OS give rise to S —» QB, A —» C 0 S and C 0 —»0. 

(iv) A —> 1AA, B 05? give rise to A -» CiAA and B —> Q>BB. 
V* = {5, A, B, C 0 , O}. 

Step 3 G 2 = (Vy, (0. 1}, P 2 , 5), where P 2 and are constructed 
as follows: 

(i) A —> 0, B —> 1, S —> CiA, B —> C]5, C[ —> 1, S ^ CqB, 

A C 0 5, C 0 —» 0 are included in P 2 . 

(ii) A -> C X AA and B —> CqBB are replaced by A —» QD^ D { —> 
AA, B —> CqD 2 . ^2 —^ BB. 

Thus, G 2 = ({5, A, B, C 0 , C b D 1? D 2 }, {0, 1}, P 2 , 5) is in CNF and 
equivalent to the given grammar where P 2 consists of 5 —> CiA|CoB, 
A OlCo^lCiDi, B -> llQSlCoDj, Ci -> 1, C 0 -> 0. Dj -> AA and 
D 2 -» BB. 

(b) Step 2 Gi = (V'v, {a, Z>, c}, P 1? 5), where Pi and V' N are 
defined as follows: 

(i) S —> a, S —» b are included in ?! 

(ii) S —» cSS is replaced by S —» CSS, C —> c, V# = {S, C} 

Step 3 Go = (Fv, {a, Z?, c}, P 2 , S), where P 2 is defined as follows: 

(i) S 7 a. S ^ Z?, C ^ c are included in P 2 . 

(ii) S —» CSS is replaced by S CD and D —» SS. 

Thus, the equivalent grammar in CNF is Go = ({5, C, D}, {a, B, c}, 
Po, 5), where P 2 consists of S —> a\b\ CD, C —> c, D —> SS. 

6.13 Consider G = ({5}, {a. B, +, *}, P, S), where P consists of 5 —» S + 5, 
S —^ S * S, S —^ <3, s —^ b. 

Step 2 Gi = (F.y, {< 2 , b , +, *}, P 1? 5), where P L is constructed as 
follows: 

(i) S —> a, S —> B are included in P 1? 

(ii) S ^ S + S and 5 —» 5 * S are replaced by S —> SAS, S —» SBS, 
A —^ +, B —^ * 

Fy = {S, A, B) 
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Step 3 G 2 = (Vjy, {a, 5, +, *}. P 2 , *S) ? where P 2 is constructed as 
follows: 

(i) S —» a, 5 A, A —> + and 5 —> * are included in P 2 . 

(ii) S —» 5A5 and 5 —» 555 give rise to 5 —» SA b A x —> AS, 
5 -4 SB h B { 55. 

The required grammar in CNF is 

G 2 = ({5, A, 5, Aj, 5j}, {a, 6 , +, *}, P 2 , 5) 
where P 2 consists of 

5 -> a\b\SA { \SB h A B ->*, A x AS and B { 55 

6.14 (a) Rename 5 as A h By Remark following Theorem 6.9, it is enough 
to replace terminals by new variables to get an equivalent grammar 
Gj. Now, G\ is defined as 

Gi = «A lf A 2 , A 3 }, {0, 1}, P b Aj) 
where 5] consists of 

A | ^ AiA 1 |A 2 A 1 A 3 |A 2 A 3 , A 2 —^ 0 and A 3 —> 1 
This completes step 1. 

Step 2 All productions of G { except A { ~~>A X A { are in proper form. 
Applying Lemma 6.2 to Aj —> A [A], we get a new variable Z t and 
new productions A x A 2 A 1 A 3 Z 1 1 A 2 A 3 Zj, Zj —> A], Zj AjZj. The 
new grammar is 

G 2 = ({Aj, A 2 , A 3 , Zj}, {a, 5}, P 2 , AO 
where P 2 consists of 

Ai —> A 2 A[A 3 1A 2 A 3 1 A 2 A[A 3 Z[ | A 2 A 3 Z[ 

Z[ —> AjZj, Zj —^ A|, A 2 —> 0 and A 3 —^ 1 

Step 3 As A 3 -productions and A 2 - productions are in proper form we 
have to modify only the Aj-productions using Lemma 6.1. So the 
modified A [-productions are 

A[ OA 1 A 3 |OA 3 |OA 1 A 3 Z 1 |OA 3 Z 1 

Step 4 The productions Z x A x and Z\ —> A X Z X are modified using 
Lemma 6.1. They are: 

Z[ —^ 0A[A 3 | 0A 3 j 0A[A 3 Z[ j 0A 3 Z[ 

Z[ —> 0A[A 3 Z[ [ 0A 3 Z[ | 0A 1 A 3 Z[Z 1 | 0A 3 Z]Z[ 

Thus the required equivalent grammar in GNF is 

G 3 = ({A[, A 2 , A 3 , Z[}, {0. 1}. P 3 , A[), where P x consists of 
A[ —^ 0A[A 3 [ 0A 3 [ 0A[A 3 Z[ j 0A 3 Z[ 

A 2 —> 0, A 3 —> 1 

Z[ -> 0A[A 3 1 0A 3 1 0A[A 3 Z[ I 0A 3 Z[ 

Zj -> 0AjA 3 Z[ I 0A 3 Z[ I 0A 1 A 3 Z[Z[ I 0A 3 Z[Z[ 
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(b) Step 1 Replace B —> aSb by B -4 aSC and C -4 b. Rename 5, 
A, B and C by A u A 2 , A 3 and A 4 . The resulting grammar is 

G\ = ({A 1? A 2 , A 3j A 4 }, {a, B}, P b AO 

where P { consists of A\ -4 A 2 A 3 . A 2 -4 Ay4 3 

A 2 —^ A 3 A[A 3 , A 2 —4 B, A 3 —> aA 2 A 4 , A 3 —4 r/ and A 4 —4 b. 

Steps 2, 3 and 4 Step 2 construction is not necessary for G } . 
The only A 4 -production, A 4 -4 b, is in proper form. So we go to step 
4. The modified A 2 -productions are: 

At —4 aA 2 A 4 A 3 j rzA 3 j ^A 2 A 4 A[A 3 | &A|A 3 { B. 

The modified A t -productions are: 

Aj —> aA 2 A 4 A3A 3 1 0A3A3 | ^A 2 A 4 A|A 3 A 3 1 #A[A3A 3 | BA 3 . 

Step 5 is not necessary since there is no new variable in the form in 
Zj. So an equivalent grammar in GNF is 

G 2 — ({A]. A 2 . A 3 , A 4 }. {ci, B}, P 2 , A{) 

where P 2 consists of 

A! -4 aA 2 A 4 Ay\ 2 | aA 3 A 3 | aA 2 A 4 A 1 A 3 A 3 | aA { Ay \ 3 | BA 3 
At —4 ^A 2 A 4 A 3 j aA 3 j aA 2 A 4 AiA 3 j 6tA]A 3 | B 

A 3 —4 tfA 2 A 4 j ci. A 4 —4 b. 

6.15 The grammar given in Exercise 6.7 has the following productions: 

S —4 aB. S —4 ab , A —4 ciAB. A —4 u, B —4 ABb and B —4 b . Of these, 
S -4 aB. A -4 aAB, A —> a and B -4 B are in the required form. So 
we replace the terminals which appear in the second and subsequent 
places of the R.H.S. of S -4 ab and B -4- ABb by a new variable C 
and add a production C —4 b. Renaming S, B. A and C as A\, A 2 , A 3 
and A 4 , the modified productions turn out to be A\ —4 aA 2 , Ai -4 aA 4 , 
A 3 —4 ctA 2 A 2 '! A 3 —4 m A 2 —4 A 3 A 2 A 4 , A 2 —4 b and A 4 —4 B. 

This completes the first three steps: 

Step 4 A 2 -4 A 3 A 2 A 4 is replaced by A 2 —4 aA 3 A 2 A 2 A 4 | aA 2 A 4 . The 
other productions are in the proper form. The resulting grammar in 
GNF has the productions 

A { —4 ciA 2 | A 4 , A 2 —4 ^A 3 A 2 A 2 A 4 [ #A 2 A 4 1 B, A 3 —4 tzA 3 A 2 |^7, A 4 —4 b. 

The grammar in Exercise 6.10 has unit productions. Eliminating unit 
productions, we get an equivalent grammar G, where 

G = ({S, A. B. C, D. E}, {m B}, P. 5) 

where P consists of S -4 AB, A -4 a, B —4 a | B, C —4 a, Z) —» a 
and E -4 m Rename S, A, B. C, D and E as A 1? A 2 , A 3 , A 4 , A 5 and 
A 6 . 
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Thus P consists of A\ —^ A 2 A 3 , A 2 —^ ci , A 3 —> 22 j b , A 4 —^ < 2 , A 5 —^ <2 
and A^ —^ ci . 

We have to modify only A t —» A 2 A 3 using Lemma 6.1. 

Thus an equivalent grammar in GNF has the following productions: 

A i —> aA 3 , A 2 —^ 22 . A 3 —^ ci | b , A 4 —^ a, A 5 —> a and A 5 —> ci . 

6.16. (a) The given language is generated by a grammar whose productions 
are S —> aSa \ bSb \ c. 

Step 2 (i) S —» c is in Pi 

(ii) P -» aPa and 5 Z?PZ? give rise to 

S —> AiSA, P —> BSB , A —> 22 . P —> Z? in Pi 

Thus Gj = ({P, A, 5}, { a , Z?, c}, P b 5), where Pi consists of 

P —^ ASA | BSB J c , A —> <2 and P —^ Z?. 

Step 3 The equivalent grammar G 2 in CNF is defined by 
G 2 = ({5, A, B . A b PJ, {a, Z?, c }. P 2 , P), where P 2 consists of 

5 —> AA b A] —> PA, P —> PP b Pi —^ SP, A —> a, P —> Z>, P —> c. 

(b) The grammar generating the given set is having the productions 
S —> bA | aB, A —> Z?AA | aP | a, P —> aBB \bS\b. 

Step 2 The productions obtained in this step are: 

S —^ P[A, Pi —^ Z?, P —^ A]P, Ai —> ci. A —> PjAA, A —^ AiP, 

A —^ P —^ A\BB , B —> P}P, P —^ b. 

Step 3 The equivalent grammar in CNF is given by 
G 2 = (V%, {a, Z>}, P 2 , 5), where P 2 consists of 

(i) P —^ B\A , P[ —^ Z?, P —> A[P, A] —^ A —> A[P, A —> 22 , 

P —^ PiP, P —^ b 

(ii) A -» P]Ci, Ci —> AA, P —» A } C 2 , C 2 —» PP (corresponding to 
A —> PjAA and P —> AiPP). 

6.17 (a) The grammar generating the given language has the productions 
S —> aSa , P —» Z?PZ>, P c. The first two productions will be in GNF 
if the last symbols on R.H.S. are variables. Hence P —» aSa , P Z?PZ? 
can be replaced by P —» aPA, A —> a, P -» hPP. P —» b. Hence 
G' = ({P, A, P}, {< 2 , b }, P', P), where P / consists of P ^ aPA | Z?PP, 
P —> c, A —> a, P —> b is in GNF and is generating the given 
language. 

(b) The given language is generated by 

G = ({P, A, P}, {«, Z?}, P, P), where P consists of 
S hA\aB, A bAA \aS\a. B aBB \bS\b . This itself is in GNF. 

(c) The given language is generated by a grammar whose productions 

are P —> aAb, S —> aA, A —> aA, A —> a, S —» a, P —> Z?P, P —» b 

and P —» b. Of these productions we have to modify only one 
production namely, P —> aSb. This is done by replacing this 
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production by S —» aSB x , —» fe. (iVote: In this problem we can also 
replace S —»aSZ? by 5 —> aS5 alone. —> Z? is already in the grammar 
and there are no other 5-productions.) 

(d) The given language is generated by a grammar whose productions 
are S —» aSh , S —> cS. 5 —» c. The equivalent grammar in GNF is 

G = ({5, 5}, {a, 6 , c}, P , 5), 
where P consists of S —> aS5, 5 —» fo, 5 —» c5, S —> c. 

6.18 (i) Let w e L(G) and \w\ = k In the Chomsky normal form, each 
production yields one terminal or two variables but nothing else. For 
getting the terminals in w, we have to apply production of the form 
A —> a (k times). The corresponding string of variables, which is of 
length k can be obtained by k - 1 steps. (Each production A —» BC 
increases the number of variables by one.) So the total number of 
steps is 2k - 1. (The reader is advised to prove this result by induction 
on |w|.) 

(ii) When G is in GNF, the number of steps in the derivation of w 
is k(k - M). The number of terminals increases by 1 for each 
application of a production to a sentential form. Hence the number of 
steps in the derivation of w is k 

6.19 Step 1 Let n be the natural number obtained by applying pumping 
lemma. 

Step 2 Let ~ = a 11 *. Write z = uvwxy where 1 < \vx\ < n . (This is 
possible since \vwx\ < n by (ii) of pumping lemma.) Let \vx\ = m , 
m < n. By pumping lemma, uv 2 wx 2 y is in L. As \uv 2 wx 2 y\ > n 2 , 

| uv 2 wx 2 y\ = k 2 , where k > n + 1. But \uv 2 wx'y\ = n" + m < nr + 
2 n + 1. So |ziv 2 wx 2 > > | strictly lies between n 2 and (n + l) 2 which means 
uv 2 wx 2 y <£ L, a contradiction. Hence {a n "\ n > 1} is not context-free. 

6.20 (a) Take z - d l h n c n in L(G). Write z = uvwxy, where 1 < \vx\ < n. 
So vx cannot contain all the three symbols a , b and c. So uv 2 wx 2 y 
contain additional occurrences of two symbols (found in vx) and the 
number of occurrences of the third symbol remains the same. This 
means the number of occurrences of the three symbols in uv 2 wx 2 y are 
not the same and so uv^wxry £ L. This is a contradiction. Hence the 
language is not context-free. 

(b) As usual, n is the integer obtained from pumping lemma. 
Let z = d l b n c~ n . Then z — uvwxy , where 1 < |vx| < n. So vx cannot 
contain all the three symbols a, b and c. If vx contains only a’s and 
V s then we can choose i such that uv l wx l y has more than 2n 
occurrences of a (or b) and exactly 2 n occurrences of c. This means 
uv l wx l y £ L, a contradiction. In other cases too, we can get a 
contradiction by proper choice of i. Thus the given language is not 
context-free. 
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6.21 (a) Suppose G - (V N , X, P, S) is right-linear. A production of the 

form A —» aia 2 . . . a m B , m > 2 can be replaced by A —> ajA b 
A] —> a 2 A 2 • • A Z?]Z ? 2 . . . b m , m > 2, can be 

replaced by A —^ B\ —^ b 2 B 2 ^ »• ^m -2 —^ ’ P/n-i —^ 

b m . The required equivalent regular grammar G’ is defined by the new 
productions constructed above. 

If G = (Vtf, X, P, 5) is left-linear, then an equivalent right-linear 
grammar can be defined as G { = (V'v, X, Pi, 5), where P[ consists of 

(i) S -4 w when S —» w is in P and w e X*, 

(ii) S —> wA when A —> w is in P and w e X*, 

(iii) A —> wB when B —> Aw is in P and we X*, 

(iv) A —» w when S —» Aw is in P and w e X*. 

Let w e L(G). If S => w then S -» w is in P. Therefore, S w is 
in Pi (by (i)). 

Assume S => AjWj => A 2 w 2 wj ==>... A^w^i ... W] => vv ffI w m _j 
... wj = w is a derivation in G. Then the productions applied in the 
derivation are S AjWj, A\ —> A 2 w 2 , ..A m _i —> w m . The induced 
productions in Gj are 

A\ •—^ Wj, A 2 —^ w 2 A[, A 2 —^ wyL? . * S —^ w ;?? A 77 j_j 

(by (ii), (iii) and (iv)‘in the construction of Pj). 

Taking the productions in the reverse order we get a derivation of G\ 
as follows: 

S => =>■•«=> W m w ffl _ i ... W 3 A 2 => W m ... w 2 A[ => w m ...w { 

Thus L(G) c L(G'). The other inclusion can be proved in a similar 
way. So G is equivalent to a right-linear grammar G\ which is 
equivalent to a regular grammar. 

(b) Let G = ({5, A}, { a , b , c}, P, 5). where P consists of S 
Sc | Ac, A aAb\ab. G is linear (by the presence of A —» aAb). 

L(G) = {aW , |ro, n > 1} 

Using pumping lemma we prove that L(G) is not regular. Let n be the 
number of states in a finite automaton accepting L(G). 

Let w = d x b n c n . By pumping lemma w = xyz, where | xy j < n and 
\y | > 0. If y = cP then xz - a n ~ k b n d n . This is not in L(G). By pumping 
lemma, xz £ L(G) a contradiction. 

6.22 L - L(G), where G is a regular grammar. For every variable A in G. 
A => a implies a = uB , where w e X* and B e V, Thus G is nonself¬ 
embedding. To prove the sufficiency part, assume that G is a nonself¬ 
embedding, context-free grammar. If G' is reduced, in Greibach 
normal form and equivalent to G, then G' is also nonself-embedding. 
(This can be proved.) Let |X| = n and m be the maximum of the 
lengths of right-hand sides of productions in G\ Let a be any 
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sentential form. By considering leftmost derivations, we can show that 
the number of variables in a is < mn. (Use the fact that G / is in GNF). 
Define: 

G x = (V 4 v, 2, Pi, 5) where 

V' N = {[a]||a| < mn and a e Vtf} 

Si = [Si] 

P { = {[A/3] —> b[a f5] | A —> ba is in P, [5 e Vy and |a/3| < mn} 
G\ is regular. It can be verified that L(G\) = L(G'). 


Chapter 7 

7.1 (q 0 , aacaa , Zq) ]— ( 90 * acaa, aZ$) |— (, q 0 , caa, tftfZo) |— (q\, a , aaZ 0 ) 
|— (#i, a, ^Zq) |— (9 b A, Zq) |— ( 9 /> A, Zq). 

(i) Yes, the final ID is (gy, A, Zq). 

(ii) Yes, the final ID is (q h A, aZo). 

(iii) No, the pda halts at (q u ba , aZ 0 ). 

(iv) Yes, the final ID is (<y 1? A, abaZ 0 ) 

(v) Yes. the final ID is (q§. A, babaZ$). 

7.2 (i) ( 91 , A. aZ 0 ). 

(ii) Halts at ( 9 1; b. A). 

(iii) 0?o, A. a 5 Zo). 

(iv) Does not move. 

(v) Does not move. 

(vi) Halts at (q h ab . Zq). 

7.3 (a) Example 7.9. 

(b) The required pda A is defined as follows: 

A = (too* 9b 9 :}, K {a, Zq}, & 9 0 , Zq, 0)* <5 is defined by 

5(<7o, ^ Zq) = {(9b flZ 0 )}, 5(9b a, a) = {(q x , ad)} 

S(q \, 5, a) = {fe a)}, 5(<72, b, a) = {(q u A)} 

5(flfi, A, Zq) = {(9b A)}. 

(c) A = ({< 7 o 9ih {«> b, c}, {Zq, ZJ, <5, 9 0 , Zq, 0) 

<5 is defined by 

8 (q 0 , a, Zq) = {(9o* z iZo)h 5(g 0 , a, Z L ) = {(q Q: Z X Z X )} 

5(9o, 5, ZO - {(9u A)}, 5(9 b &, ZO = {( 91 , A)} 

5(9b c, Z 0 ) = {(9u Zo)}, 5(9i, A, Zq) = {(9b A)} 

Note that, on reading a , we add Z\\ on reading b we remove Zi and 
the state is changed. If the input is completely read and the stack 
symbol is Zq, then it is removed by a A-move. 

7.4 (a) Example 7.9 gives a pda accepting { a n b m a " | m, n > 1} by null 
store. Using Theorem 7.1, a pda B accepting the given language by 
final state is constructed. 

B = ({< 7 o, < 7 u <7o> <?/}> {«. b}. {a, Z 0 , Z' 0 }, S. q' 0 , Z' 0 , {q f \) 


——‘- 

i 
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S is defined by 

<5(4 0 , A, Z'q) = {(4o, ZoZ'o)} 

<5(4o, A, Z' 0 ) = {{q f , A)} = 5(4o, A, Z' 0 ) 

5 ( 4 ,, A, Z' 0 ) = {{qf, A)} = 8{q u A, Z' 0 ) 

5 ( 4 0 , a, Zq) = {( 4 0 , oZq)}, 8(q 0 , a, a) = {( q 0 , aa)} 

5 ( 4 q, 5, a) = {( 4 ,, a)}, 8{q h b , a) = {( 4 ,, a)} 

5 ( 4 ,, a, a) = {{qf A)}, 8{q u A, Zq) = {{q h A)} 

7.5 (a) G = ({5, S,, S 2 }, {d 5}, P, 5) generates the given language, 
where P consists of S —> 5,, S —» S 2 , 5, —» aS,5, 5, —> a5, S 2 —» 
aSibb, S 2 —> a55. The pda accepting L(G) by null store is 

A = ({ 4 }, {a, 5}, {5, S,, S 2 , a, b), 5, 4 , 5, 0 ) 
where 5 is defined by the following rules: 

5 ( 4 , A, S) = {( 4 , 5,), ( 4 , 5 2 )} 

5 ( 4 , A. 5,)= {( 4 - aSi, b ), ( 4 , ab)} 

5 ( 4 , A, SV) = {( 4 , aS 2 bb), ( 4 , aM>)} 

5(4, a, a) = 5 ( 4 . 5, 5) = {( 4 , A)} 

7.6 (i) G = ({5}, {a, 5}, P, 5), where P consists of 5 —> aS5, S —> aS, 

S a, generates 

{a"'5"|n < m}. For 5 => a"W. m > 0. 

5 => a", n > 1 , and hence 5 => d"a n b n \ m > 0, n > 1. 

So L{G) c {a m 5"jn < m). The other inclusion can be proved 
similarly. 

(ii) The pda A accepting L(G) by null store is given by 

A = ({ 4 }, {a, b), {5, a, b), 8, 4 , 5, 0) 
where 5 is defined by the following rules: 

5 ( 4 , A, S) = {( 4 , aSb), ( 4 , aS), ( 4 , a)} 

5 ( 4 , a, a) = 8{q , 5, 5) = {( 4 , A)} 

(iii) Define P = (Q\ X, T', 5 B , 40 , Z'q, F), where 

Q' = {< 7 o, < 7 o, 9 /}, r' = (S, a, 5, Z' 0 ) 

P = {. (We apply Theorem 7.1 to (ii)). 

8 b is given by 

Wo . A. Z'q) = {( 4 , ZoZ'q)} 

4(4, A, 5) = {( 4 , aS5), ( 4 , aS), ( 4 , a)} 

5 s ( 4 , a, a) = {( 4 , A)} = S B {q, b, b) 

S B {q, A, Z'q) = {( 4 ^ A)} 

{Note: S B {q, a, S) = 8{q, a, S) = 0 and 5 B ( 4 , 5, 5) = 5 ( 4 , 5, 5) = 0) 
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7.7 (i) Define G = ({5}, {a, b }, P, S ), where P consists of 

jS —> SaSbSaS , j5 —^ SaSaSbS , 5 —> SbSaSaS , 5 —> A 

(Refer to Exercise 4.10(e).) G is the required grammar, (ii) Apply 
Theorem 7.3. (iii) Apply Theorem 7.1 to the pda obtained in (ii). 

7.8 Let A = ({g 0 , gj, {a, &}, {Zq, a, b}, 5, g 0 , Zq, 0), where <5 is given 
by 

<5(<7o Zq) = {(go, aZq)}, <5(go, b, Zq) = {(go, bZo)} 

8 (q 0 , a, b) = {(go, ai)}, 5(g 0 , b, a) = {(g 0 , ba)} 

<5(g 0 , a, a) = {(go, aa), (gi, A)} 

<5(g 0 , b, b) = {(g 0 , bb), (g, A)} 

<5(g 0 , A, Zq) = {(g b A)} 

<5(g b a, a) = {(g b A)}, <5(g b b, b) = {(g b A)} 

<5(g b A, Zq) = {(g,. A)} 

A makes a guess whether it has reached the centre of the string. A 
reaches the centre only when the input symbol and the topmost 
symbol on PDS are the same. This explains the definition of 
<5(g 0 , a, a ) and <3(g 0 . b. b). A accepts the given set by null store. 

7.9 Example 7.6 gives a pda A accepting the given set by empty store. 
The only problem is that it is not deterministic. We have 8{q, a, Zq) 
= {(g, aZo)} and 8 (q, A, Z 0 ) = {(g, A)}. So A is not deterministic 
(refer to Definition 7.5). But the construction can be modified as 
follows: 

Ai = ({g. gj, {«, b). {Zq, a, b}, 8, g, Zq, 0) 
where 5 is defined by the following rules: 

<5(g, a , Zq) = {(g b aZ 0 )}, 8(q, b, Zq) = {(g b bZ o)} 

5(g b a, a) = {(g b aa)}, <5(g b b, b) = {(g b bb)} 

8(q u a, b) = {(g b A)}. <5(g b b, a) = {(g b A)} 

<5(g b A, Zo) = {(g b A)} 

A] is deterministic and accepts the given set by empty store. 

7.10 The S-productions are 

S —> [go, Zq, go] | [go, Zq. gi] 

8(q\, b, a) = {(g„ A)}. 8 (q h A, Zq) = {(g b A)} 
and 

8(q 0 , b, a) = {(g b A)} 

Now these induce [g b a, gj —> b. [g b Zo, gj] —* A and [g 0 , a, gj] 
—> b, respectively. <5(g 0 , a, Zq) = {(go, aZo)} induces 
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too, <7ol a too, <7o] [<7o^ Zo> <7o] 

too, Zq, %] —> a too, a ' 4\][<]u 2* <7o] 

too, .Zb, <7il ~ > a too, a > <7o]too, Zq, <?i] 

too* Zq, gj —> 4<7o, a ' <7i]tob Zq, < 7 j] 

5(<7o» <z, a) = {too, aa )} induces 
too, a ' <7ol «too, <7o]too, a » #o] 

too. <*, <?ol -* ^too, «, ^lltoi. «, %] 

too, 0 , «i] 4<?o> a, ^oltoo, a, 9 j] 

too, CL, q i] -> a[q 0 , a , tfJtoi, <*, ^7 i3 

7.13 Let M = (2, 5, <? 0 , F) be a DFA accepting a given regular set. 

Define a pda A by A = (Q, X, {Zq}. <5 b < 7 0 , Zq, F). 8 is given by the 
following rule: 

8 \(q, a , Zq) = {too, Zq)} if <?to, a) - 

It is easy to see that T(M) - T(A). Let w e T(M). Then <5 too, w) = 
q e F. 

5too, W, Zq) = {to', Zb)}. So w e F(A), i.e., F(M) c F(A). The 
proof that F(A) c F(M) is similar. 

7.15 If <Sto, a, z) contains (q\ Z X Z 2 . . - Z„), n > 3, we introduce new states 
<7b <72, • • •, < 7 n- 2 * We define new transitions involving new states as 
follows: 

(i) tob Z 7 j_ 1 Z„) is included in 8 (q, a , Z) 

Cii) (5(^ ; , A, Z„_,) = {(g ;+ i, Z„_;)} for i = 1, 2, .. n - 3 

(iii) <5(g„_ 2 , A, Z 2 ) = {( 9 ', Z,Z 2 )} 

This construction is repeated for every transition given by (q\ y) e 
8 {q, a , Z), |y| > 3. Deleting such transitions and adding the new 
transitions induced by them we get a pda which never adds more than 
one symbol at a time. 

Chapter 8 

8.1 For a sentential form such as a n+l b ! \ A —» a is the production applied 
in the last step only when a is followed by ah. So A —> a is a handle 
production if and only if the symbol to the right of a is scanned and 
found to be b. Similarly, A —» aAb is a handle production if and only 
if the symbol to the right of aAb is b. Also, S —» aAb is a handle 
production if and only if the symbol to the right of aAb is A. 
Therefore, the grammar is LR(1), but not LR(0). 

8.2 We can actually show that the given grammar is not LR(fc) for any 
k > 0. Suppose it is LR(fc) for some k. Consider the rightmost 
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derivations of 0l Zk+{ 2 and 01 2t+3 2 given by: 

S => 01 k A\ k 2 => 01 2;+1 2 = apw (A8.1) 

where a = 01*, ft = a, w = 1*2. 

5 ^ 01* +1 A1* +1 2 => 01 2 * +3 2 => a'l 3V (A8.2) 

/? /? 

where a' = 01* +1 , /T = a, W = 1* +1 2. As the strings formed by the 
first 2 k + 1 symbols (note \af}\ + k - 2k + 1) of afiw and a'fi'w' are 
the same, a - a\ i.e. 01* = 01* +1 , which is a contradiction. Thus the 
given grammar is not LR(&) for any k. 

8.3 The given grammar is ambiguous and hence is not LR(£) for any k. 
For example, there are two derivation trees for ab. 

8.4 As a n b n c n appears in both the sets, it admits two different derivation 
trees. So the set cannot be generated by an unambiguous grammar. 

Chapter 9 

9.2 The set of quintuples representing the TM consists of q { blLq 2 , 

q x 00Rq x , q 2 bbRq : „ q 2 00Lq 2 ' q$bRq$, q 3 lbRq 5 . q 4 bORq 5 , 

q 4 00Rq 4 , q 4 llRq 4 , q 5 bOLq 2 . 

9.3 The computation for the first symbol 1 is ^llMl f— bq 2 bll. 
Afterwards it halts. 

9.4 The computation sequence for the substring 12 of 1213 is 

q x 1213 |— &7 2 213 |— M^ 3 13. 

As S(q 2 , 1) is not defined, the TM halts. For 2133 and 312 the TM 
does not start. 

9.6 Modify the construction given in Example 9.7. 

9.8 We have the following steps for processing the even-length 
palindromes: 

(a) The Turing machine M scans the first symbol of the input tape 
(0 or 1), erases it and changes state (q x or q 2 ). 

(b) M scans the remaining part without changing the tape symbol 
until it encounters b. 

(c) The R/W head moves to the left. If the rightmost symbol tallies 
with the leftmost symbol (which can be erased but remembered), 
the rightmost symbol is erased. Otherwise M halts. 

(d) The RAV head moves to the left until b is encountered. 

Steps (a), (b). (c), (d) are repeated after changing the states suitably. 
The transition table is defined by Table A9.1. 
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TABLE A9.1 Transition Table for Exercise 9.8 


Present state 

0 

Input symbol 

1 

b 

9o 

bRq< 

bRq 2 

bRq 7 

9i 

0Rc?i 

IRqi 

bLqz 

92 

ORqi 

■\Rq 2 

bLq 4 

93 

bLq s 



94 


bi-q 6 


95 

0 Lq s 

1^5 

bRq 0 

96 

0Lq e 

1 Uk 

bRq 0 

© 





9.9 We have three states q Q , q h qj, where q 0 is the initial state used to 
remember that even number of l’s have been encountered so far. q\ is 
used to remember that odd number of l’s have been encountered so far. 
q f is the final state. The transition table is defined by Table A9.2. 


TABLE A9.2 Transition Table for Exercise 9.9 


Present state 

0 

1 

b 

9o 

ORqo 

IRC?! 

bRq f 

9i 

ORqi 

1Rq 0 


_a_ 





9.10 The construction given in Example 9.7 can be modified. As the number 
of occurrences of c is independent of that of a or b, after scanning the 
rightmost c , the R/W head can move to the left and erase c. 

9.11 Assume that the input tape has 0"T0 /Z where m - n is required. We 
have the following steps: 

(a) The leftmost 0 is replaced by b and the R/W head moves to the 
right. 

(b) The R/W head replaces the first 0 after 1 by 1 and moves to the 
left. On reaching the blank at the left end the cycle is repeated. 

(c) Once the 0’s to the left of l’s are exhausted. M replaces all 0’s and 
l’s by V s. a - b is the number of 0’s left over in the input tape 
and equal to 0. 

(d) Once the 0’s to the right of l’s are exhausted, n 0’s have been 
changed to l’s and n 4* 1 of m 0’s have been changed to b. M 
replaces l’s (there are n + 1 l’s) by one 0 and n V s. The number 
of 0’s remaining gives fhe values of a — b. The transition table is 
defined by Table A9.3. 
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TABLE A9.3 Transition Table for Exercise 9.11 


Present state 


Input symbol 



0 

1 

b 

Qo 

bRqi 

bRq s 


9i 

o Rq-I 

1 RQ2 


Q2 

1^3 

1Rq 2 

DLq A 

<?3 

0 Lq 3 

1^3 

bRq 0 

Qa 

0Lq 4 

bLq 4 

0 Rq e 

<75 

bRq s 

bRq 5 

bRq$ 


Chapter 10 

10.2 1. (. B , w) is an input to M. 

2. Convert B to an equivalent DFA A. 

3. Run the Turing machine M x for A dfa on input (A, w) 

4. If M x accepts, M accepts; otherwise M rejects 

10.3 Construct a TM M as follows: 

1. (A) is an input to M. 

2. Mark the initial state of A (q 0 marked as q% a new symbol). 

3. Repeat until no new states are marked: a new state is marked if 

there is a transition from a state already marked to the new state. 

4. If a final state is marked, M accepts (A); otherwise it rejects. 

10.4 Let L = ( T(A { ) - T(A 2 )) u (T(A 2 ) - T(Ai)). L is regular and L = T{A'). 

Apply E dva to (AO- j 

10.8 Use Examples 10.4 and 10.5. j 

10.9 A tm is regarding a given Turing machine accepting an input, that is, j 

reaching an accepting state after scanning w and halting HALT tm is 
regarding a given TM halting on an input (or M need not accept w in 

this case). I 

10.10 Represent a number between 0 and 1 as 0 • a\a 2 . . . where a\, a 2 , . . . j 

are binary digits. Assume the set to be a sequence, apply I 

diagonalization process and get a contradiction. | 

10.11 When a problem is undecidable, we can modify or take a particular case j 

of the problem and try for algorithms. Studying undecidable problems 

may kindle an imagination to get better ideas on computation. | 

i 

10.12 Suppose the problem is solvable. Then there is an algorithm to decide | 

whether a given terminal string w is in L. Let M be a TM. Then there 

is a grammar G such that L{G) is the same as the set accepted by M. f 

Then w e L(G) if and only if M halts on w. This means that the halting 
problem of TM is solvable, which is a contradiction. Hence the 
recursiveness of a type 0 grammar is unsolvable. | 
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10.14 Suppose there exists a Turing Machine M over {0, 1} and a state q m 
such that the problem of determining whether or not M will enter q m 
is unsolvable. Define a new Turing machine M' which simulates M and 
has the additional transition given by S(q m , A) = (q„ v 1, R ). Then M 
enters q m when it starts with a given tape configuration if and only if 
M' prints 1 when it starts with a given tape configuration. Hence the 
given problem is unsolvable. 

10.17 According to Church’s thesis we can construct a Turing Machine which 
can execute any given algorithm. Hence the given statement is false. 
(Of course, the Church’s thesis is not proved. But there is enough 
evidence to justify its acceptance.) 

10.18 Let X = {a}. Let x = (x x , x 2 , . . x n ) and y = (y b y 2i . . y n ), where 
Xf = a ki , i = 1, 2, . . n and yj = a l j , j - 1,2,..., n. Then (xi) Il (x 2 ) 12 
• . -{x n ) ln - (yx^iyif 2 • • - (y n ) kn - Both are equal to c^ kili . Hence PCP is 
solvable when |Z| = 1. 

10.20 Xi = 01, yi = Oil, x 2 = 1, y 2 = 10, x 3 = 1, y 3 = 1. Hence |^| < \y ( \ 
for i - 1 , 2, 3. So x ix x l2 . . . x im ^ y^y^ . . . y itn for no choice of V s. 
Note: \x il x i2 . . . x im | < | y /L y b . . . y,-J. Hence the PCP with the given 
lists has no solution. 

10.21 = 0, y x = 10 , .r 2 = 110, y 2 = ^3 = 001, y 3 = 10. Here no pair 

(x h y^, (jc 2 , y 2 ) or (jt 3 , y 3 ) has common nonempty initial substring. So 
x ix x i2 ... x im & >^> 7 2 .. . y im for no choice of if s. Hence the PCP with 
the given lists has solution. 

10.22 As x { = >'], the PCP with the given lists has a solution. 

10.23 In this problem, x\ = 1, x 2 = 10, x 3 = 1011, y x = 111, y 2 = 0, 
y 3 = 10. Then, xyx\x x x 2 = y 3 >’iyiy 2 = 101111110. Hence the PCP with 
the given lists has a solution. Repeating the sequence 3, 1, 1, 2, we can 
get more solutions. 

10.24 Both (a) and (b) are possible. One of them is possible by Church’s 
thesis. Find out which one? 

Chapter 11 

11.1 (a) The function is defined for all natural numbers divisible by 3. 

(b) x = 2 

(c) x > 2 

(d) all natural numbers 

(e) all natural numbers 

11.2 (a) X{ 0 j( 0 ) = 1 , X{ 0 }(x + 1 ) ~ Z( 0 }Sgn(M*)) 

(b) f{x + 1 ) = jr + 2x + 1 

So, f(x + 1) = fix) + S(S(Z(x))) * U{(x) + SiZix )) 
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Hence / is obtained by recursion and addition of primitive recursive 
functions 

(c) Ax, y) = y + (x - y) 

(d) Define parity function P,.(y) by 

/VO) = P,(2) = ■ ■ ■ = 0, P r ( 1) = PA 3) = • • ■ = 1 

P r is primitive recursive since P r { 0) = 0, Pfx + 1) = iUfix), 
P r (x)). Define / by /(0) = 0, fix + 1) = fix) + P r (x). 

(e) sgn(0) = Z(0), sgn(x + 1) = SiZ(Uf (x, sgn (x)») 

(f) L(x, y) = sgn(x ^ y) 

(g) Eix, y) = *<o)(C* - y) + (y - x)) 

All the functions (a)-(g) are obtained by applying composition and 
recursion to known primitive functions and hence primitive recursive 
functions. 

11.3 A(l, y) = Ail + 0, y - 1 + 1) = A(0, A(l, y - 1)) using (11.10) of 
Example 11.11. Using (11.8), we get 

A(l, y) = 1 + A(l, y - 1). 

Repeating the argument, we have 

A(l, y)=y - 1+A(1, l) = y + 2 (By Example 11.12, A(l, 1) = 3). 
This result is used in evaluating A(3, 1). 

A(2, 3) = A(1 + 1, 2 + 1) = A(l, A(2, 2)) = A(l, 7) using 
Example 11.12. Using A(l, y) = y + 2, we get A(2, 3) = 2 + 7 = 9. 
Then using (11.10), A(3, 1) = A(2 + 1, 0 + 1) = A(2, A(3, 0)) 

By Example 11.12, A(2, 1) = 5. Also, A(3, 0) = A(2, 1) by (11.9). 
Hence A(3, 1) = A(2, 5) = A(1 + 1, 4 + 1) = A(l, A(2, 4)). Since 
A(l, y) = y + 2, A(3, 1) = 2 + A(2, 4). Applying (11.10), we have 
A(2, 4) = A(l, A(2, 3)) = 2 + A(2, 3) = 2 + 9 = 11. 

Hence, A(3, 1) = 2 + 11 = 13. 

A(3, 2) = A(2, A(3, 1)) = A(2, 13) = A(l, A(2, 12)) = 2 + A(2, 12) 
= 2 + A(l, A (I, 11)) = 2 + A(l, 13) = 2 + 2 + 13 = 17. 

To evaluate A(3, 3), we prove A(2, y + 1) = 2y + A(2, 1). Now, 

A(2, y + 1) = A(1 + 1, y + 1) = A(l, A(2, y) = 2 + A(2, y). 

Repeating this argument, A(2, y + 1) = 2y + A(2, 1). Now, 

A(3, 3) = A(2 + 1, 2 + 1) = A(2, A(3, 2)) = A(2, 17) = 2(16) + 
A(2, 1) = 32 + 5 = 37. 
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11.4 (b) It is clear that r(x, 0) = 0. Also, r(x, y) increases by 1 when y is 
increased by 1 and r(x, y) = 0 when y = x. Using these observations we 
see that r(x, y + 1) = S(r (x, y)) * sgn(x - S(r (x, y))). Hence 
r(x , y) is 1 defined by 

r{x, 0) = 0 

r(x, y + 1) = S(r(x. y)) * sgn(x - S(r(x, y))) 

11.5 f{x) is the smallest value of y for which (y + l) 2 > x. Therefore, 
J{x) = My(X[0}((y + l) 2 “ *))» / is partial recursive since it is obtained 
from primitive recursive functions by application of minimization. 

11.8 The constant function fix) = 1 is primitive recursive for /(0) = 1 and 
fix + 1) = U 2 2 (x, fix)). Now Xac\B and Xaub are recursive for 
Xac = 1 Za> Xadb = Xa * Xb and Xaub = Xa + Xb ^ Xa rs 
(Addition and proper subtraction are primitive recursive functions and 
the given functions are obtained from recursive functions using 
composition.) 

11.9 Let E denote the set of all even numbers. ^(0) = 0, %fin + 1) = 
1 - $gn(U 2 (n, Xe( h ))' The sign function and proper subtraction 
function are primitive recursive. Thus E is obtained from primitive 
recursive functions using recursion. Hence % E is primitive recursive and 
hence recursive. To prove the other part use Exercise 11.8. 

11.11 Define / by fiO) = k, fin + 1) = Uf(n, fin)). Hence / is primitive 
recursive. 

1H2 X{a h a 2 ,...,a n } = X[a x \ + X{a 2 } + As X{a x \ « primitive 

recursive. (Refer to Exercise 11.2(a)) and the sum of primitive 
recursive functions is primitive recursive, X{a b a% . a n } i s primitive 
recursive. 

11.13 Represent x and y in tally notation. Using Example 9.6 we can compute 
concatenation of strings representing x and y which is precisely x + y 
in tally notation. 

11.14 Let M in the Post notation have {q u q 2 , .. q n } and {a b a 2 , . . ., a m } 
as Q and Z respectively. Let Q> - {q h .. ., q n , q n+{ , . . q 2n }, where 
q n+ 1 . . q 2n are new states. Let a quadruple of the form qjCijRq k induce 
the quintuple qfLfx k Rq k . Let a quadruple of the form q\CifLq k induce the 
quintuple q i ajajLq k . Finally, let q^a^i induce qiOja k Rq n+i . We 
introduce quintuples q )l+ iaja t Lqi for i = 1, 2, .. ., n and t = 1, 2, 3, 
. . ., m. The required TM has Q' as the set of states and the set of 
quintuples represent 8 . 
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11.15 q 0 llllx { by p- llllq^by j— llllq { xby. As b lies between x { and y, 
Z(4) = 0 (given by b ). 

11.16 In Section 11.4.5 we obtained q^lx^by p- q 5 lx { My. Similarly, 
q§\l\x x by p- q 5 ll\x { bly |— lq$\\x\b\y. Proceeding farther, 
lc/plU^My p- lq 5 llxiblly p-^* llq 0 lxiblly p- Hl^^llHy (as in 
Section 11.4.5). Hence 5(3) - 4. 

11.18 Represent the argument x in tally notation. J{x) = 5(S(x)). Using the 
construction given in Section 11.4.7, we can construct a TM which 
gives the value 5(S(x)). 

11.19 fix \, X 2 ) = S(S(U 2 (x h %>)). Use the construction in Section 11.4.7. 

11.20 Represent (x { , x 2 ) by T V M A -. By taking the input as $l Vl M A2 ($ is 
representing the left-end) and suitably modifying the TM given in 
Example 9.6, we get the value of x l + x 2 to the right of $. 

Chapter 12 

k l 

12.1 Denote f(n) = X an 1 and g(«) = X bid, where a h b h are positive 

/=o " /—0 ' 

k 

integers. Assume k > 1. Then f(n) + g(n) = X (a,- 4- b t )n\ where bj = 0 

i=0 

for i > l f{n) + g(n) is a polynomial of degree k. Hence f(n)g(ri) = 
0(n k+i ). 

12.2 As rr dominates n log n and n 2 log n dominates n 2 , the growth rate of 
h(ri) > growth rate of g{n). Note j\n) = g(n) = 0(n 2 ). 

n n n 

12.3 As X i = n(n + l)/2, X r = n(n + l)(2n + l)/6 and X r = (n(n + l)/2) 2 , 

i =0 i=0 o i=0 

the answers for (i) and (ii) are 0(n 3 ) and 0(n 2 ). (iii) a( 1 - r 11 )!! - r- 

0() J1 /r) = 0(P 1 “ 1 ). (iv) " [2a + (n-i)d] = 0(n 2 ). 

12.4 As log 2 n, log 3 n, log e n, differ by a constant factor, fin) = <9(m log/t) 

12.5 gcd = 3. 

12.6 The principal disjunctive normal form of the boolean expression has 
5 terms (refer to Example 1.13). P a Q a R is one such term. So 
(T, E T) satisfies the given expression. Similar assignments for the 
other four terms. 

12.7 No. 

12.8 (F, F, F, F) makes the given expression satisfiable. 

12.9 Only if: Take an AF-complete problem L. Then L is in CO-NP = NP. 
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if: Let P be A'P-complete and p e NP. Let L be any language in NP. 
We get a polynomial reduction (j) of L to P and hence a polynomial 
reduction yr of L to p . We prove NP c CO-NP. Combine yr and 
nond'eterministic polynomial-time algorithms for p to get a 
nondeterministic polynomial-time algorithm for L '. So L e NP or L 
e CO-NP. This proves NP c CO-NP. The other inclusion is similar. 
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